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FIELD OF THE INVENTION 

The present invention relates to isolated polynucleotides that represent a 
complete gene, or a fragment thereof, that is expressed. In addition, the present 
invention relates to the polypeptide or protein corresponding to the coding sequence 
of these polynucleotides. The present invention also relates to isolated 
polynucleotides that represent regulatory regions of genes. The present invention also 
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relates to isolated polynucleotides that represent untranslated regions of genes. The 
present invention further relates to the use of these isolated polynucleotides and 
polypeptides and proteins. 

DESCRIPTION OF THE RELATED ART 

Efforts to map and sequence the genome of a number of organisms are in 
progress; a few complete genome sequences, for example those of E. coli and 
Saccharomyces cerevisiae are known (Blattner et al., Science 2ZZ:1453 (1997); Goffeau 
et al., Science 224:546 (1996)). The complete genome of a multicellular organism, C. 
elegans, has also been sequenced (See, the C. elegans Sequencing Consortium, Science 
282 :2012 (1998)). To date, no complete genome of a plant has been sequenced, nor has 
a complete cDNA complement of any plant been sequenced. 
SUMMARY OF THE INVENTION 

The present invention comprises polynucleotides, such as complete cDNA 
sequences and/or sequences of genomic DNA encompassing complete genes, fragments 
of genes, and/or regulatory elements of genes and/or regions with other functions and/or 
intergenic regions, hereinafter collectively referred to as Sequence-Determined DNA 
Fragments (SDFs), from different plant species, particularly corn, wheat, soybean, rice 
and Arabidopsis thaliana, and other plants and or mutants, variants, fragments or 
fusions of said SDFs and polypeptides or proteins derived therefrom. In some instances, 
the SDFs span the entirety of a protein-coding segment. In some instances, the entirety 
of an mRNA is represented. Other objects of the invention that are also represented by 
SDFs of the invention are control sequences, such as, but not limited to, promoters. 
Complements of any sequence of the invention are also considered part of the invention. 

Other objects of the invention are polynucleotides comprising exon sequences, 
polynucleotides comprising intron sequences, polynucleotides comprising introns 
together with exons, intron/exon junction sequences, 5' untranslated sequences, and 3' 
untranslated sequences of the SDFs of the present invention. Polynucleotides 
representing the joinder of any exons described herein, in any arrangement, for example, 
to produce a sequence encoding any desirable amino acid sequence are within the scope 
of the invention. 

The present invention also resides in probes useful for isolating and identifying 
nucleic acids that hybridize to an SDF of the invention. The probes can be of any length, 
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but more typically are 12-2000 nucleotides in length; more typically, 15 to 200 
nucleotides long; even more typically, 18 to 100 nucleotides long. 

Yet another object of the invention is a method of isolating and/or identifying 

nucleic acids using the following steps: 

(a) contacting a probe of the instant invention with a polynucleotide sample 
under conditions that permit hybridization and formation of a polynucleotide duplex; and 

(b) detecting and/or isolating the duplex of step (a). 

The conditions for hybridization can be from low to moderate to high stringency 
conditions. The sample can include a polynucleotide having a sequence unique in a 
plant genome. Probes and methods of the invention are useful, for example, without 
limitation, for mapping of genetic traits and/or for positional cloning of a desired 

fragment of genomic DNA. 

Probes and methods of the invention can also be used for detecting alternatively 
spliced messages within a species. Probes and methods of the invention can further be 
used to detect or isolate related genes in other plant species using genomic DNA 
(gDNA) and/or cDNA libraries. In some instances, especially when longer probes and 
low to moderate stringency hybridization conditions are used; the probe will hybridize to 
a plurality of cDNA and/or gDNA sequences of a plant. This approach is useful for 
isolating representatives of gene families which are identifiable by possession of a 
common functional domain in the gene product or which have common cis-acting 
regulatory sequences. This approach is also useful for identifying orthologous genes 

from other organisms. 

The present invention also resides in constructs for modulating the expression 
of the genes comprised of all or a fragment of an SDF. The constructs comprise all or 
a fragment of the expressed SDF, or of a complementary sequence. Examples of 
constructs include ribozymes comprising RNA encoded by an SDF or by a sequence 
complementary thereto, antisense constructs, constructs comprising coding regions or 
parts thereof, constructs comprising promoters, introns, untranslated regions, scaffold 
attachment regions, methylating regions, enhancing or reducing regions, DNA and 
chromatin conformation modifying sequences, etc. Such constructs can be 
constructed using viral, plasmid, bacterial artificial chromosomes (BACs), plasmid 
artificial chromosomes (PACs), autonomous plant plasmids, plant artificial 
chromosomes or other types of vectors and exist in the plant as autonomous 
replicating sequences or as DNA integrated into the genome. When inserted into a 
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host cell the construct is, preferably, functionally integrated with, or operatively 
linked to, a heterologous polynucleotide. For instance, a coding region from an SDF 
might be operably linked to a promoter that is functional in a plant. 

The present invention also resides in host cells, including bacterial or yeast cells 
or plant cells, and plants that harbor constructs such as described above. Another aspect 
of the invention relates to methods for modulating expression of specific genes in plants 
by expression of the coding sequence of the constructs, by regulation of expression of 
one or more endogenous genes in a plant or by suppression of expression of the 
polynucleotides of the invention in a plant. Methods of modulation of gene expression 
include without limitation (1) inserting into a host cell additional copies of a 
polynucleotide comprising a coding sequence; (2) modulating an endogenous promoter 
in a host cell; (3) inserting antisense or ribozyme constructs into a host cell and (4) 
inserting into a host cell a polynucleotide comprising a sequence encoding a variant , 
fragment, or fusion of the native polypeptides of the instant invention. 
BRIEF DESCRIPTION OF THE TABLES 

The sequences of exemplary SDFs and polypeptides corresponding to the 
coding sequences of the instant invention are described in any of the Reference Tables 
1-1, 1-2, etc. (hereinafter referred to as the REF Tables"); and in any of the Sequence 
Tables 2-1, 2-2, etc. (hereinafter referred to as the SEQ Tables"). The REF Tables 
refer to a number of Maximum Length Sequences" or MLS." Each MLS 
corresponds to the longest cDNA obtained, either by cloning or by the prediction from 
genomic sequence. The sequence of the MLS is the cDNA sequence as described in 
the Av subsection of the REF Tables. 

The REF Tables include the following information relating to each MLS: 

I. cDNA Sequence 

A. 5' UTR 

B. Coding Sequence 
c. y UTR 

II. Genomic Sequence 

A. Exons 

B. Introns 

C. Promoters 

III. Link of cDNA Sequences to Clone IDs 

IV. Multiple Transcription Start Sites 
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V. Polypeptide Sequences 

A. Signal Peptide 

B. Domains 

C. Related Polypeptides 

VI. Related Polynucleotide Sequences 
L cDNA SEQUENCE 

The REF Tables indicate which sequence in The SEQ Tables represent the 
sequence of each MLS. The MLS sequence can comprise 5 5 and 3' UTR as well as 
coding sequences. In addition, specific cDNA clone numbers also are included in The 
REF Tables when the MLS sequence relates to a specific cDNA clone. 

A. 5' UTR 

The location of the 5' UTR can be determined by comparing the most 5' MLS 
sequence with the corresponding genomic sequence as indicated in The REF Tables. 
The sequence that matches, beginning at any of the transcriptional start sites and 
ending at the last nucleotide before any of the translational start sites corresponds to 
the 5' UTR. 

B. Coding Region 

The coding region is the sequence in any open reading frame found in the 
MLS. Coding regions of interest are indicated in the PolyP SEQ subsection of The 
REF Tables. 

C. 3 5 UTR 

The location of the 3 5 UTR can be determined by comparing the most 3' MLS 
sequence with the corresponding genomic sequence as indicated in The REF Tables. 
The sequence that matches, beginning at the translational stop site and ending at the 
last nucleotide of the MLS corresponds to the 3 ? UTR. 
IL GENOMIC SEQUENCE 

Further, the REF Tables indicate the specific gi" number of the genomic 
sequence if the sequence resides in a public databank. For each genomic sequence, 
the REF Tables indicate which regions are included in the MLS. These regions can 
include the 5' and 3' UTRs as well as the coding sequence of the MLS. See, for 
example, the scheme below: 



Region 1 
I 5 f UTR | Exon | 



Region 2 

■- j Exon [ 



Region 3 

j Exon | 3 1 UTR | — 
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I 

Promoter 



Trans lational 
Start Site 



Intron 



I 

Intron 



I 

Stop Codon 



The REF Tables report the first and last base of each region that are included 
in an MLS sequence. An example is shown below: 
gi No. 47000: 
37102 ... 37497 
37593 ... 37925 

The numbers indicate that the MLS contains the following sequences from two 
regions of gi No. 47000; a first region including bases 37102-37497, and a second 
region including bases 37593-37925. 

A, EXON SEQUENCES 

The location of the exons can be determined by comparing the sequence of the 
regions from the genomic sequences with the corresponding MLS sequence as 
indicated by the REF Tables. 

i. INITIAL EXON 

To determine the location of the initial exon, information from the 

(1) polypeptide sequence section; 

(2) cDNA polynucleotide section; and 

(3) the genomic sequence section 

of the REF Tables are used. First, the polypeptide section will indicate where the 
translational start site is located in the MLS sequence. The MLS sequence can be 
matched to the genomic sequence that corresponds to the MLS. Based on the match 
between the MLS and corresponding genomic sequences , the location of the 
translational start site can be determined in one of the regions of the genomic 
sequence. The location of this translational start site is the start of the first exon. 

Generally, the last base of the exon of the corresponding genomic region, in 
which the translational start site was located, will represent the end of the initial exon. 
In some cases, the initial exon will end with a stop codon, when the initial exon is the 
only exon. 
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In the case when sequences representing the MLS are in the positive strand of 
the corresponding genomic sequence, the last base will be a larger number than the 
first base. When the sequences representing the MLS are in the negative strand of the 
corresponding genomic sequence, then the last base will be a smaller number than the 
first base. 



ii. INTERNAL EXONS 

Except for the regions that comprise the 5 ? and 3' UTRs, initial exon, and 
terminal exon, the remaining genomic regions that match the MLS sequence are the 
internal exons. Specifically, the bases defining the boundaries of the remaining 
regions also define the intron/exon junctions of the internal exons. 

iii. TERMINAL EXON 

As with the initial exon, the location of the terminal exon is determined with 
information from the 

(1) polypeptide sequence section; 

(2) cDNA polynucleotide section; and 

(3) the genomic sequence section 

of the REF Tables. The polypeptide section will indicate where the stop codon is 
located in the MLS sequence. The MLS sequence can be matched to the 
corresponding genomic sequence. Based on the match between MLS and 
corresponding genomic sequences, the location of the stop codon can be determined 
in one of the regions of the genomic sequence. The location of this stop codon is the 
end of the terminal exon. Generally, the first base of the exon of the corresponding 
genomic region that matches the cDNA sequence, in which the stop codon was 
located, will represent the beginning of the terminal exon. In some cases, the 
translational start site will represent the start of the terminal exon, which will be the 
only exon. 

In the case when the MLS sequences are in the positive strand of the 
corresponding genomic sequence, the last base will be a larger number than the first 
base. When the MLS sequences are in the negative strand of the corresponding 
genomic sequence, then the last base will be a smaller number than the first base. 

B. INTRON SEQUENCES 
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In addition, the introns corresponding to the MLS are defined by identifying 
the genomic sequence located between the regions where the genomic sequence 
comprises exons. Thus, introns are defined as starting one base downstream of a 
genomic region comprising an exon, and end one base upstream from a genomic 
region comprising an exon. 

C. PROMOTER SEQUENCES 

As indicated below, promoter sequences corresponding to the MLS are 
defined as sequences upstream of the first exon; more usually, as sequences upstream 
of the first of multiple transcription start sites; even more usually as sequences about 
2,000 nucleotides upstream of the first of multiple transcription start sites. 

11L LINK of cDNA SEQUENCES to CLONE IDs 

As noted above, the REF Tables identify the cDNA clone(s) that relate to each 
MLS. The MLS sequence can be longer than the sequences included in the cDNA 
clones. In such a case, the REF Tables indicate the region of the MLS that is included 
in the clone. If either the 5 ' or 3' termini of the cDNA clone sequence is the same as 
the MLS sequence, no mention will be made. 
IV. Multiple Transcription Start Sites 

Initiation of transcription can occur at a number of sites of the gene. The REF 
Tables indicate the possible multiple transcription sites for each gene. In the REF 
Tables, the location of the transcription start sites can be either a positive or negative 
number. 

The positions indicated by positive numbers refer to the transcription start sites as 
located in the MLS sequence. The negative numbers indicate the transcription start 
site within the genomic sequence that corresponds to the MLS. 

To determine the location of the transcription start sites with the negative 
numbers, the MLS sequence is aligned with the corresponding genomic sequence. In 
the instances when a public genomic sequence is referenced, the relevant 
corresponding genomic sequence can be found by direct reference to the nucleotide 
sequence indicated by the gi" number shown in the public genomic DNA section of 
the REF Tables. When the position is a negative number, the transcription start site is 
located in the corresponding genomic sequence upstream of the base that matches the 
beginning of the MLS sequence in the alignment. The negative number is relative to 
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the first base of the MLS sequence which matches the genomic sequence 
corresponding to the relevant gi" number. 

In the instances when no public genomic DNA is referenced, the relevant 
nucleotide sequence for alignment is the nucleotide sequence associated with the 
amino acid sequence designated by gi" number of the later PolyP SEQ subsection. 
V. P olypeptide Sequences 

The PolyP SEQ subsection lists SEQ ID NOs and Ceres SEQ ID NO for 
polypeptide sequences corresponding to the coding sequence of the MLS sequence 
and the location of the translational start site with the coding sequence of the MLS 
sequence. 

The MLS sequence can have multiple translational start sites and can be 
capable of producing more than one polypeptide sequence. 

A. Signal Peptide 

The REF Tables also indicate in subsection (B) the cleavage site of the 
putative signal peptide of the polypeptide corresponding to the coding sequence of the 
MLS sequence. Typically, signal peptide coding sequences comprise a sequence 
encoding the first residue of the polypeptide to the cleavage site residue. 

B. Domains 

Subsection (C) provides information regarding identified domains (where 
present) within the polypeptide and (where present) a name for the polypeptide 
domain. 

C. Related Polypeptides 

Subsection (Dp) provides (where present) information concerning amino acid 
sequences that are found to be related and have some percentage of sequence identity 
to the polypeptide sequences of the REF Tables and the SEQ Tables. These related 
sequences are identified by a gi" number. 
YL Related Polynucleotide Sequences 



Subsection (Dn) provides polynucleotide sequences (where present) that are 
related to and have some percentage of sequence identity to the MLS or 
corresponding genomic sequence. 



Abbreviation 


Description 


Max Len. Seq. 


Maximum Length Sequence 


rel to 


Related to 


Clone Ids 


Clone ID numbers 


Pub gDNA 


Public Genomic DNA 
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Abbreviation 


Description 


gi No. 


gi number 


Gen. seq. in cDNA 


Genomic Sequence in cDNA 

(Each region for a single gene prediction is 

listed on a separate line. 

xxi taac ui iiiuiupic gene preaiCTions, tne 

srouo of refnons rplatino tn a QincrlA nrf*Hir»ti^rk 

are separated by a blank line) 


(Ac) cDNA SEQ 


cDNA sequence 


- Pat. Appln. SEO ID NO 


Patent Application SEQ ID NO: 


- Ceres SEQ ID NO: 1673877 


Ceres SEQ ID NO: 


- SEQ # w. TSS 


T or*ation \xnthin tYie* r > Y^\J A oonnonnQ cca tt*\ 

i^u^auuu vviiiim me sequence, otivj ID 
NO:, of Transcription Start Sites which are 
listed helnw 


- Clone ID #:#-># 


Clone ID comprises hasps U tn U nf the* r>F>>j a 

■v-'avxxw iiy wuhi^iiicjwo L/aoto ft l\J ft \JL lilts i,-' ■# |^ f~\ 

Sequence 


PolyP SEQ 


Polypeptide Seauence 


- Pat. Appln. SEQ ID NO: 


Patent Application SEQ ID NO: 


- Ceres SEQ ID NO 


Ceres SEO ID NO- 


- Loc. SEQ ID NO: @ nt. 


Location of translational start site in cDNA of 

SEO ID NO' at rmrlentiHp nnmhpr 


(C) Pred. PP Nom. & Annot. 


Nomination and Annotation nf Dnmiiinc within 
Predicted Polypeptide(s) 


- (Title) 


Name of Domain 


- Loc. SEQ ID NO #:#-># aa. 


1 J or*at , ion nf trip HmnQin nrithin tha nnitTnan^'rin 

j-^^v^aiiwii ui me Liuiiiaiii wiimn tne poiypeptioe 

Of SEO ID NO' from U to U amino aoirl 
WJ - vJJ-JVi j. J-/ i ^ w. -Li will tt l(J tt amino a CI CI 

residues. 


(Dp) Rel. AA SEQ 


Related Amino Acid Seauenres 


- Align. NO 


Alignment number 


- gi No 


Gi number 


- Desp. 


Description 


- % Idnt. 


Percent identity 


- Align. Len. 


Alignment Length 

a. e> 


- Loc. SEQ ID NO: # -> # aa 


Location within SEQ ID NO: from # to # 
amino acid residue. 



DETAILED DESCRIPTION OF THE INVENTION 



The invention relates to (I) polynucleotides and methods of use thereof, such as 

IA. Probes, Primers and Substrates; 

IB. Methods of Detection and Isolation; 
B.l. Hybridization; 

B.2. Methods of Mapping; 

B.3. Southern Blotting; 

B.4. Isolating cDNA from Related Organisms; 

B.5. Isolating and/or Identifying Orthologous Genes 

IC. Methods of Inhibiting Gene Expression 
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C.l. Antisense 

C.2. Ribozyme Constructs; 

C.3. Chimeraplasts; 

C.4 Co-Suppression; 

C.5. Transcriptional Silencing 

C.6. Other Methods to Inhibit Gene Expression 

ID. Methods of Functional Analysis; 

IE. Promoter Sequences and Their Use; 

IF. UTRs and/or Intron Sequences and Their Use; and 

IG. Coding Sequences and Their Use. 

The invention also relates to (II) polypeptides and proteins and methods of use 
thereof, such as IIA. Native Polypeptides and Proteins 

A.1 Antibodies 

A. 2 In Vitro Applications 

IIB. Polypeptide Variants, Fragments and Fusions 

B. l Variants 
B.2 Fragments 
B.3 Fusions 

The invention also includes (III) methods of modulating polypeptide production, 



IIIA. Suppression 

A.l Antisense 
A.2 Ribozymes 
A.3 Co-suppression 

A.4 Insertion of Sequences into the Gene to be Modulated 
A.5 Promoter Modulation 

A. 6 Expression of Genes containing Dominant-Negative Mutations 

IIIB. Enhanced Expression 

B. l Insertion of an Exogenous Gene 
B.2 Promoter Modulation 



such as 
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The invention further concerns (IV) gene constructs and vector construction, 



such as 

IVA. Coding Sequences 
IVB. Promoters 
IVC. Signal Peptides 

The invention still further relates to 
V Transformation Techniques 



Definitions 



Allelic variant An allelic variant" is an alternative form of the same SDF, 

which resides at the same chromosomal locus in the organism. Allelic variations can 
occur in any portion of the gene sequence, including regulatory regions. Allelic 
variants can arise by normal genetic variation in a population. Allelic variants can 
also be produced by genetic engineering methods. An allelic variant can be one that 
is found in a naturally occurring plant, including a cultivar or ecotype. An allelic 
variant may or may not give rise to a phenotypic change, and may or may not be 
expressed. An allele can result in a detectable change in the phenotype of the trait 
represented by the locus. A phenotypically silent allele can give rise to a product. 

Alternatively spliced messages Within the context of the current invention, 
alternatively spliced messages" refers to mature mRNAs originating from a single 
gene with variations in the number and/or identity of exons, introns and/or intron- 
exon junctions. 

Chimeric The term chimeric" is used to describe genes, as defined supra, or 
contructs wherein at least two of the elements of the gene or construct, such as the 
promoter and the coding sequence and/or other regulatory sequences and/or filler 
sequences and/or complements thereof, are heterologous to each other. 



^pi- 



Attorney No. 3flK-1243P 

13 



Constitutive Promoter: Promoters referred to herein as "constitutive promoters" actively 
promote transcription under most, but not necessarily all, environmental conditions and 
states of development or cell differentiation. Examples of constitutive promoters include 
the cauliflower mosaic virus (CaMV) 35S transcript initiation region and the V or 2' 
promoter derived from T-DNA of Agrobacterium tumefaciens, and other transcription 
initiation regions from various plant genes, such as the maize ubiquitin-1 promoter, 
known to those of skill. 

Coordinately Expressed: The term coordinately expressed," as used in the 
current invention, refers to genes that are expressed at the same or a similar time 
and/or stage and/or under the same or similar environmental conditions. 

Domain: Domains are fingerprints or signatures that can be used to 

characterize protein families and/or parts of proteins. Such fingerprints or signatures 
can comprise conserved (1) primary sequence, (2) secondary structure, and/or (3) 
three-dimensional conformation. Generally, each domain has been associated with 
either a family of proteins or motifs. Typically, these families and/or motifs have 
been correlated with specific in-vitro and/or in-vivo activities. A domain can be any 
length, including the entirety of the sequence of a protein. Detailed descriptions of 
the domains, associated families and motifs, and correlated activities of the 
polypeptides of the instant invention are described below. Usually, the polypeptides 
with designated domain(s) can exhibit at least one activity that is exhibited by any 
polypeptide that comprises the same domain(s). 



Endogenous The term endogenous," within the context of the current invention 
refers to any polynucleotide, polypeptide or protein sequence which is a natural part 
of a cell or organisms regenerated from said cell. 

Exogenous Exogenous," as referred to within, is any polynucleotide, 

polypeptide or protein sequence, whether chimeric or not, that is initially or 
subsequently introduced into the genome of an individual host cell or the organism 
regenerated from said host cell by any means other than by a sexual cross. Examples 
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of means by which this can be accomplished are described below, and include 
Agrobacterium-mediated transformation (of dicots - e.g. Salomon et al. EMBO J. 
3:141 (1984); Herrera-Estrella et al. EMBO J. 2:987 (1983); of monocots, 
representative papers are those by Escudero et al., Plant J. 10:355 (1996), Ishida et 
al., Nature Biotechnology 14:745 (1996), May et al., Bio/Technology 12:486 (1995)), 
biolistic methods (Armaleo et al., Current Genetics 17:97 1990)), electroporation, in 
planta techniques, and the like. Such a plant containing the exogenous nucleic acid is 
referred to here as a T 0 for the primary transgenic plant and T 2 for the first generation. 
The term exogenous" as used herein is also intended to encompass inserting a 
naturally found element into a non-naturally found location. 

Filler sequence: As used herein, filler sequence" refers to any nucleotide 
sequence that is inserted into DNA construct to evoke a particular spacing between 
particular components such as a promoter and a coding region and may provide an 
additional attribute such as a restriction enzyme site. 

Gene: The term gene," as used in the context of the current invention, encompasses 
all regulatory and coding sequence contiguously associated with a single hereditary 
unit with a genetic function (see SCHEMATIC 1). Genes can include non-coding 
sequences that modulate the genetic function that include, but are not limited to, those 
that specify polyadenylation, transcriptional regulation, DNA conformation, 
chromatin conformation, extent and position of base methylation and binding sites of 
proteins that control all of these. Genes comprised of exons" (coding sequences), 
which may be interrupted by introns" (non-coding sequences), encode proteins. A 
gene's genetic function may require only RNA expression or protein production, or 
may only require binding of proteins and/or nucleic acids without associated 
expression. In certain cases, genes adjacent to one another may share sequence in 
such a way that one gene will overlap the other. A gene can be found within the 
genome of an organism, artificial chromosome, plasmid, vector, etc., or as a separate 
isolated entity. 



Gene Family: Gene family" is used in the current invention to describe a group of 
functionally related genes, each of which encodes a separate protein. 
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Heterologous sequences: Heterologous sequences" are those that are not 
operatively linked or are not contiguous to each other in nature. For example, a promoter 
from corn is considered heterologous to anArabidopsis coding region sequence. Also, a 
promoter from a gene encoding a growth factor from corn is considered heterologous to 
a sequence encoding the com receptor for the growth factor. Regulatory element 
sequences, such as UTRs or 3' end termination sequences that do not originate in nature 
from the same gene as the coding sequence originates from, are considered heterologous 
to said coding sequence. Elements operatively linked in nature and contiguous to each 
other are not heterologous to each other. On the other hand, these same elements 
remain operatively linked but become heterologous if other filler sequence is placed 
between them. Thus, the promoter and coding sequences of a corn gene expressing ; 
amino acid transporter are not heterologous to each other, but the promoter and 
coding sequence of a corn gene operatively linked in a novel manner are 
heterologous. 



an 



Homologous gene In the current invention, homologous gene" refers to a gene 
that shares sequence similarity with the gene of interest. This similarity may be in only a 
fragment of the sequence and often represents a functional domain such as, examples 
including without limitation a DNA binding domain, a domain with tyrosine kinase 
activity, or the like. The functional activities of homologous genes are not necessarily the 
same. 



Inducible Promoter An inducible promoter" in the context of the current 

invention refers to a promoter which is regulated under certain conditions, such as light, 
chemical concentration, protein concentration, conditions in an organism, cell, or 
organelle, etc. A typical example of an inducible promoter, which can be utilized with 
the polynucleotides of the present invention, is PARSK1, the promoter from the 
Arabidopsis gene encoding a serine-threonine kinase enzyme, and which promoter is 
induced by dehydration, abscissic acid and sodium chloride (Wang and Goodman, Plant 
J. S:37 (1995)) Examples of environmental conditions that may affect transcription by 
inducible promoters include anaerobic conditions, elevated temperature, or the presence 
of light. 
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Intergenic region Intergenic region," as used in the current invention, refers to 
nucleotide sequence occurring in the genome that separates adjacent genes. 

Mutant gene In the current invention, mutant" refers to a heritable change in DNA 
sequence at a specific location. Mutants of the current invention may or may not have 
an associated identifiable function when the mutant gene is transcribed. 

Orthologous Gene In the current invention orthologous gene" refers to a second 
gene that encodes a gene product that performs a similar function as the product of a 
first gene. The orthologous gene may also have a degree of sequence similarity to the 
first gene. The orthologous gene may encode a polypeptide that exhibits a degree of 
sequence similarity to a polypeptide corresponding to a first gene. The sequence 
similarity can be found within a functional domain or along the entire length of the 
coding sequence of the genes and/or their corresponding polypeptides. 

Percentage of sequence identity "Percentage of sequence identity," as used herein, 
is determined by comparing two optimally aligned sequences over a comparison 
window, where the fragment of the polynucleotide or amino acid sequence in the 
comparison window may comprise additions or deletions (e.g., gaps or overhangs) as 
compared to the reference sequence (which does not comprise additions or deletions) for 
optimal alignment of the two sequences. The percentage is calculated by determining 
the number of positions at which the identical nucleic acid base or amino acid residue 
occurs in both sequences to yield the number of matched positions, dividing the number 
of matched positions by the total number of positions in the window of comparison and 
multiplying the result by 100 to yield the percentage of sequence identity. Optimal 
alignment of sequences for comparison may be conducted by the local homology 
algorithm of Smith and Waterman Add. APL. Math. 2:482 (1981), by the homology 
alignment algorithm of Needleman and Wunsch/. Mol. Biol. 4§:443 (1970), by the 
search for similarity method of Pearson and Lipman Proc. Natl. Acad. Sci. (USA) 85: 
2444 (1988), by computerized implementations of these algorithms (GAP, BESTFIT, 
BLAST, PASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics 
Computer Group (GCG), 575 Science Dr., Madison, WI), or by inspection. Given that 
two sequences have been identified for comparison, GAP and BESTFIT are preferably 
employed to determine their optimal alignment. Typically, the default values of 5.00 for 



; or 
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gap weight and 0.30 for gap weight length are used. The term "substantial sequence 
identity" between polynucleotide or polypeptide sequences refers to polynucleotide < 
polypeptide comprising a sequence that has at least 80% sequence identity, preferably at 
least 85%, more preferably at least 90% and most preferably at least 95%, even more 
preferably, at least 96%, 97%, 98% or 99% sequence identity compared to a reference 
sequence using the programs. 

Plant Promoter A plant promoter" is a promoter capable of initiating 

transcription in plant cells and can drive or facilitate transcription of a fragment of the 
SDF of the instant invention or a coding sequence of the SDF of the instant invention. 
Such promoters need not be of plant origin. For example, promoters derived from plant 
viruses, such as the CaMV35S promoter or from Agrobacterium tumefaciens such as the 
T-DNA promoters, can be plant promoters. A typical example of a plant promoter of 
plant origin is the maize ubiquitin-1 (ubi-l)promoter known to those of skill. 



Promoter: The term "promoter," as used herein, refers to a region of 

sequence determinants located upstream from the start of transcription of a gene and 
which are involved in recognition and binding of RNA polymerase and other proteins to 
initiate and modulate transcription. A basal promoter is the minimal sequence necessary 
for assembly of a transcription complex required for transcription initiation. Basal 
promoters frequently include a TATA box" element usually located between 15 and 35 
nucleotides upstream from the site of initiation of transcription. Basal promoters also 
sometimes include a CCAAT box" element (typically a sequence CCAAT) and/or a 
GGGCG sequence, usually located between 40 and 200 nucleotides, preferably 60 to 
120 nucleotides, upstream from the start site of transcription. 

Public sequence: The term public sequence ," as used in the context of the 
instant application, refers to any sequence that has been deposited in a publicly 
accessible database. This term encompasses both amino acid and nucleotide 
sequences. Such sequences are publicly accessible, for example, on the BLAST 
databases on the NCBI FTP web site (accessible at ncbi.nlm.gov/blast). The database 
at the NCBI GTP site utilizes gi" numbers assigned by NCBI as a unique identifier 
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for each sequence in the databases, thereby providing a non-redundant database for 
sequence from various databases, including GenBank, EMBL, DBBJ, (DNA Database 
of Japan) and PDB (Brookhaven Protein Data Bank). 

Regulatory Sequence The term regulatory sequence," as used in the current 

invention, refers to any nucleotide sequence that influences transcription or translation 
initiation and rate, and stability and/or mobility of the transcript or polypeptide 
product. Regulatory sequences include, but are not limited to, promoters, promoter 
control elements, protein binding sequences, 5 ' and 3' UTRs, transcriptional start site, 
termination sequence, polyadenylation sequence, introns, certain sequences within a 
coding sequence, etc. 



Related Sequences: Related sequences" refer to either a polypeptide or a 
nucleotide sequence that exhibits some degree of sequence similarity with a sequence 
described by the REF Tables and the SEQ Tables. 

Scaffold Attachment Region (SAR) As used herein, scaffold attachment region" is a 
DNA sequence that anchors chromatin to the nuclear matrix or scaffold to generate 
loop domains that can have either a transcriptionally active or inactive structure 
(Spiker and Thompson (1996) Plant Physiol. 110: 15-21). 

Sequence-determined DNA fragments (SDFs) Sequence-determined DNA 

fragments" as used in the current invention are isolated sequences of genes, fragments 
of genes, intergenic regions or contiguous DNA from plant genomic DNA or cDNA 
or RNA the sequence of which has been determined. 

Signal Peptide A signal peptide" as used in the current invention is an amino 

acid sequence that targets the protein for secretion, for transport to an intracellular 
compartment or organelle or for incorporation into a membrane. Signal peptides are 
indicated in the tables and a more detailed description located below. 

Specific Promoter In the context of the current invention, specific promoters" 
refers to a subset of inducible promoters that have a high preference for being induced 
in a specific tissue or cell and/or at a specific time during development of an 
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organism. By high preference" is meant at least 3-fold, preferably 5-fold, more 
preferably at least 10-fold still more preferably at least 20-fold, 50-fold or 100-fold 
increase in transcription in the desired tissue over the transcription in any other 
tissue. Typical examples of temporal and/or tissue specific promoters of plant origin 
that can be used with the polynucleotides of the present invention, are: PTA29, a 
promoter which is capable of driving gene transcription specifically in tapetum and only 
during anther development (Koltonow et al., Plant Cell 2:1201 (1990); RCc2 and RCc3, 
promoters that direct root-specific gene transcription in rice (Xu et al., Plant Mol. Biol. 
22:237 (1995); TobRB27, a root-specific promoter from tobacco (Yamamoto et al., 
Plant Cell 3:371 (1991)). Examples of tissue-specific promoters under developmental 
control include promoters that initiate transcription only in certain tissues or organs, such 
as root, ovule, fruit, seeds, or flowers. Other suitable promoters include those from genes 
encoding storage proteins or the lipid body membrane protein, oleosin. A few root- 
specific promoters are noted above. 

Stringency "Stringency" as used herein is a function of probe length, probe 
composition (G + C content), and salt concentration, organic solvent concentration, and 
temperature of hybridization or wash conditions. Stringency is typically compared by the 
parameter T m , which is the temperature at which 50% of the complementary molecules 
in the hybridization are hybridized, in terms of a temperature differential from T m . High 
stringency conditions are those providing a condition of T m - 5°C to T m - 10°C. Medium 
or moderate stringency conditions are those providing T m - 20°C to T m - 29°C. Low 
stringency conditions are those providing a condition of T m - 40°C to T m - 48°C. The 
relationship of hybridization conditions to T m (in °C) is expressed in the mathematical 
equation 

T m = 81.5 -16.6(logio[Na + ]) + 0.41(%G+C) - (600/N) (1) 

where N is the length of the probe. This equation works well for probes 14 to 70 
nucleotides in length that are identical to the target sequence. The equation below for T m 
of DNA-DNA hybrids is useful for probes in the range of 50 to greater than 500 
nucleotides, and for conditions that include an organic solvent (formamide). 
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T m = 81.5+16.6 log {[Na + ]/(l+0.7[Na + ])}+ 0.41(%G+C)-500/L 0.63(%formamide) 
(2) 

where L is the length of the probe in the hybrid. (P. Tijessen, Hybridization with 
Nucleic Acid Probes" in Laboratory Techniques i n Biochemistry and Molecular 
Biology, P.C. vand der Vliet, ed., c. 1993 by Elsevier, Amsterdam.) The T m of 
equation (2) is affected by the nature of the hybrid; for DNA-RNA hybrids T m is 10- 
15°C higher than calculated, for RNA-RNA hybrids T m is 20-25°C higher. Because the 
T m decreases about 1 °C for each 1% decrease in homology when a long probe is used 
(Bonner et al., J. Mol Biol. 81:123 (1973)), stringency conditions can be adjusted to 
favor detection of identical genes or related family members. 

Equation (2) is derived assuming equilibrium and therefore, hybridizations 
according to the present invention are most preferably performed under conditions of 
probe excess and for sufficient time to achieve equilibrium. The time required to 
reach equilibrium can be shortened by inclusion of a hybridization accelerator such as 
dextran sulfate or another high volume polymer in the hybridization buffer. 

Stringency can be controlled during the hybridization reaction or after 
hybridization has occurred by altering the salt and temperature conditions of the wash 
solutions used. The formulas shown above are equally valid when used to compute 
the stringency of a wash solution. Preferred wash solution stringencies lie within the 
ranges stated above; high stringency is 5-8°C below T m , medium or moderate 
stringency is 26-29°C below T m and low stringency is 45-48°C below T m . 

Substantially free of A composition containing A is substantially free of B 

when at least 85% by weight of the total A+B in the composition is A. Preferably, A 
comprises at least about 90% by weight of the total of A+B in the composition, more 
preferably at least about 95% or even 99% by weight. For example, a plant gene or 
DNA sequence can be considered substantially free of other plant genes or DNA 
sequences. 



Translational start site In the context of the current invention, a translational 

start site" is usually an ATG in the cDNA transcript, more usually the first ATG. A 
single cDNA, however, may have multiple translational start sites. 
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Transcription start site Transcription start site" is used in the current invention 

to describe the point at which transcription is initiated. This point is typically located 
about 25 nucleotides downstream from a TFIID binding site, such as a TATA box. 
Transcription can initiate at one or more sites within the gene, and a single gene may 
have multiple transcriptional start sites, some of which may be specific for transcription 
in a particular cell-type or tissue. 

Untranslated region (UTR) A UTR" is any contiguous series of nucleotide bases 
that is transcribed, but is not translated. These untranslated regions may be associated 
with particular functions such as increasing mRNA message stability. Examples of 
UTRs include, but are not limited to polyadenylation signals, terminations sequences, 
sequences located between the transcriptional start site and the first exon (5' UTR) 
and sequences located between the last exon and the end of the mRNA (3' UTR). 

Variant: The term variant" is used herein to denote a polypeptide or protein or 
polynucleotide molecule that differs from others of its kind in some way. For 
example, polypeptide and protein variants can consist of changes in amino acid 
sequence and/or charge and/or post-translational modifications (such as glycosylation, 
etc). 



DETAILED DESCRIPTION OF THE INVENTION 

L Polynucleotides 

Exemplified SDFs of the invention represent fragments of the genome of corn, 
wheat, rice, soybean or Arabidopsis and/or represent mRNA expressed from that 
genome. The isolated nucleic acid of the invention also encompasses corresponding 
fragments of the genome and/or cDNA complement of other organisms as described in 
detail below. 

Polynucleotides of the invention can be isolated from polynucleotide libraries 
using primers comprising sequence similar to those described by the REF Tables and 
the SEQ Tables. See, for example, the methods described in Sambrook et al., supra. 
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Alternatively, the polynucleotides of the invention can be produced by 
chemical synthesis. Such synthesis methods are described below. 



It is contemplated that the nucleotide sequences presented herein may contain 
some small percentage of errors. These errors may arise in the normal course of 
determination of nucleotide sequences. Sequence errors can be corrected by obtaining 
seeds deposited under the accession numbers cited above, propagating them, isolating 
genomic DNA or appropriate mRNA from the resulting plants or seeds thereof, 
amplifying the relevant fragment of the genomic DNA or mRNA using primers having a 
sequence that flanks the erroneous sequence, and sequencing the amplification product. 

I.A. Probes. Primers and Substrates 

SDFs of the invention can be applied to substrates for use in array applications 
such as, but not limited to, assays of global gene expression, for example under 
varying conditions of development, growth conditions. The arrays can also be used in 
diagnostic or forensic methods (WO95/35505, US 5,445,943 and US 5,410,270). 

Probes and primers of the instant invention will hybridize to a polynucleotide 
comprising a sequence in the REF Tables and the SEQ Tables. Though many 
different nucleotide sequences can encode an amino acid sequence, the sequences of 
the REF Tables and the SEQ Tables are generally preferred for encoding polypeptides 
of the invention. However, the sequence of the probes and/or primers of the instant 
invention need not be identical to those in the REF Tables and the SEQ Tables or the 
complements thereof. For example, some variation in probe or primer sequence 
and/or length can allow additional family members to be detected, as well as 
orthologous genes and more taxonomically distant related sequences. Similarly, 
probes and/or primers of the invention can include additional nucleotides that serve as 
a label for detecting the formed duplex or for subsequent cloning purposes. 

Probe length will vary depending on the application. For use as primers, 
probes are 12-40 nucleotides, preferably 18-30 nucleotides long. For use in mapping, 
probes are preferably 50 to 500 nucleotides, preferably 100-250 nucleotides long. 
For Southern hybridizations, probes as long as several kilobases can be used as 
explained below. 
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The probes and/or primers can be produced by synthetic procedures such as 
the triester method of Matteucci et al. J. Am. Chem. Soc. 103 :3185( 1981); or 
according to Urdea et al. Proc. Natl. Acad. 80:7461 (1981) or using commercially 
available automated oligonucleotide synthesizers. 



LB. Methods of Detection and Isolation 

The polynucleotides of the invention can be utilized in a number of methods 
known to those skilled in the art as probes and/or primers to isolate and detect 
polynucleotides, including, without limitation: Southerns, Northerns, Branched DNA 
hybridization assays, polymerase chain reaction, and microarray assays, and 
variations thereof. Specific methods given by way of examples, and discussed below 
include: 

Hybridization 

Methods of Mapping 

Southern Blotting 

Isolating cDNA from Related Organisms 

Isolating and/or Identifying Orthologous Genes. 
Also, the nucleic acid molecules of the invention can used in other methods, such as 
high density oligonucleotide hybridizing assays, described, for example, in U.S. Pat. 
Nos. 6,004,753; 5,945,306; 5,945,287; 5,945,308; 5,919,686; 5,919,661; 5,919,627; 
5,874,248; 5,871,973; 5,871,971; and 5,871,930; and PCT Pub. Nos. WO 9946380; 
WO 9933981; WO 9933870; WO 9931252; WO 9915658; WO 9906572; WO 
9858052; WO 9958672; and WO 9810858. 

B. 1 . Hybridization 
The isolated SDFs of the REF Tables and the SEQ Tables of the present 
invention can be used as probes and/or primers for detection and/or isolation of 
related polynucleotide sequences through hybridization. Hybridization of one nucleic 
acid to another constitutes a physical property that defines the subject SDF of the 
invention and the identified related sequences. Also, such hybridization imposes 
structural limitations on the pair. A good general discussion of the factors for 
determining hybridization conditions is provided by Sambrook et al. ("Molecular 
Cloning, a Laboratory Manual, 2nd ed., c. 1989 by Cold Spring Harbor Laboratory 
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Press, Cold Spring Harbor, NY; see esp., chapters 11 and 12). Additional considerations 
and details of the physical chemistry of hybridization are provided by G.H. Keller and 
M.M. Manak DNA Probes", 2 nd Ed. pp. 1-25, c. 1993 by Stockton Press, New York, 
NY. 

Depending on the stringency of the conditions under which these probes and/or 
primers are used, polynucleotides exhibiting a wide range of similarity to those in the 
REF and SEQ Tables can be detected or isolated. When the practitioner wishes to 
examine the result of membrane hybridizations under a variety of stringencies, an 
efficient way to do so is to perform the hybridization under a low stringency 
condition, then to wash the hybridization membrane under increasingly stringent 
conditions. 

When using SDFs to identify orthologous genes in other species, the 
practitioner will preferably adjust the amount of target DNA of each species so that, 
as nearly as is practical, the same number of genome equivalents are present for each 
species examined. This prevents faint signals from species having large genomes, and 
thus small numbers of genome equivalents per mass of DNA, from erroneously being 
interpreted as absence of the corresponding gene in the genome. 

The probes and/or primers of the instant invention can also be used to detect or 
isolate nucleotides that are identical" to the probes or primers. Two nucleic acid 
sequences or polypeptides are said to be "identical" if the sequence of nucleotides or 
amino acid residues, respectively, in the two sequences is the same when aligned for 
maximum correspondence as described below. 

Isolated polynucleotides within the scope of the invention also include allelic 
variants of the specific sequences presented in the REF and SEQ Tables. The probes 
and/or primers of the invention can also be used to detect and/or isolate polynucleotides 
exhibiting at least 80% sequence identity with the sequences of the REF and SEQ 
Tables or fragments thereof. 

With respect to nucleotide sequences, degeneracy of the genetic code provides 
the possibility to substitute at least one base of the base sequence of a gene with a 
different base without causing the amino acid sequence of the polypeptide produced 
from the gene to be changed. Hence, the DNA of the present invention may also have 
any base sequence that has been changed from a sequence in the REF and SEQ Tables 



Attorney No. ^p)-1243P 

by substitution in accordance with degeneracy of genetic code. References describing 
codon usage include: Carels et al., 7. Mol. Evol 46: 45 (1998) and Fennoy et al., 
Nucl Acids Res. 21(23) : 5294 (1993). 

B.2. Mapping 

The isolated SDF DNA of the invention can be used to create various types of 
genetic and physical maps of the genome of corn, Arabidopsis, soybean, rice, wheat, 
or other plants. Some SDFs may be absolutely associated with particular phenotypic 
traits, allowing construction of gross genetic maps. While not all SDFs will 
immediately be associated with a phenotype, all SDFs can be used as probes for 
identifying polymorphisms associated with phenotypes of interest. Briefly, one 
method of mapping involves total DNA isolation from individuals. It is subsequently 
cleaved with one or more restriction enzymes, separated according to mass, 
transferred to a solid support, hybridized with SDF DNA and the pattern of fragments 
compared. Polymorphisms associated with a particular SDF are visualized as 
differences in the size of fragments produced between individual DNA samples after 
digestion with a particular restriction enzyme and hybridization with the SDF. After 
identification of polymorphic SDF sequences, linkage studies can be conducted. By 
using the individuals showing polymorphisms as parents in crossing programs, F2 
progeny recombinants or recombinant inbreds, for example, are then analyzed. The 
order of DNA polymorphisms along the chromosomes can be determined based on 
the frequency with which they are inherited together versus independently. The 
closer two polymorphisms are together in a chromosome the higher the probability 
that they are inherited together. Integration of the relative positions of all the 
polymorphisms and associated marker SDFs can produce a genetic map of the 
species, where the distances between markers reflect the recombination frequencies in 
that chromosome segment. 

The use of recombinant inbred lines for such genetic mapping is described for 
Arabidopsis by Alonso-Blanco et al. (Methods in Molecular Biology, vol.82, 
Arabidopsis Protocols", pp. 137-146, J.M. Martinez-Zapater and J. Salinas, eds., c. 
1998 by Humana Press, Totowa, NJ) and for corn by Burr ( Mapping Genes with 
Recombinant Inbreds", pp. 249-254. In Freeling, M. and V. Walbot (Ed.), The Maize 
Handbook, c. 1994 by Springer- Verlag New York, Inc.: New York, NY, USA; Berlin 
Germany; Burr et al. Genetics (1998) 118: 519; Gardiner, J. et al., (1993) Genetics 
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134: 917). This procedure, however, is not limited to plants and can be used for other 
organisms (such as yeast) or for individual cells. 

The SDFs of the present invention can also be used for simple sequence repeat 
(SSR) mapping. Rice SSR mapping is described by Morgante et al. (The Plant 
Journal (1993) 3: 165), Panaud et al. (Genome (1995) 38: 1170); Senior et aL (Crop 
Science (1996) 36: 1676), Taramino et al. (Genome (1996) 39: 277) and Ahn et al. 
(Molecular and General Genetics (1993) 241: 483-90). SSR mapping can be 
achieved using various methods. In one instance, polymorphisms are identified when 
sequence specific probes contained within an SDF flanking an SSR are made and 
used in polymerase chain reaction (PCR) assays with template DNA from two or 
more individuals of interest. Here, a change in the number of tandem repeats 
between the SSR-flanking sequences produces differently sized fragments (U.S. 
Patent 5,766,847). Alternatively, polymorphisms can be identified by using the PCR 
fragment produced from the SSR-flanking sequence specific primer reaction as a 
probe against Southern blots representing different individuals (U.H. Refseth et al., 
(1991) Electrophoresis 18: 1519). 

Genetic and physical maps of crop species have many uses. For example, 
these maps can be used to devise positional cloning strategies for isolating novel 
genes from the mapped crop species. In addition, because the genomes of closely 
related species are largely syntenic (that is, they display the same ordering of genes 
within the genome), these maps can be used to isolate novel alleles from relatives of 
crop species by positional cloning strategies. 

The various types of maps discussed above can be used with the SDFs of the 
invention to identify Quantitative Trait Loci (QTLs). Many important crop traits, 
such as the solids content of tomatoes, are quantitative traits and result from the 
combined interactions of several genes. These genes reside at different loci in the 
genome, oftentimes on different chromosomes, and generally exhibit multiple alleles 
at each locus. The SDFs of the invention can be used to identify QTLs and isolate 
specific alleles as described by de Vicente and Tanksley (Genetics 134:585 (1993)). 
In addition to isolating QTL alleles in present crop species, the SDFs of the invention 
can also be used to isolate alleles from the corresponding QTL of wild relatives. 
Transgenic plants having various combinations of QTL alleles can then be created and 
the effects of the combinations measured. Once a desired allele combination has been 
identified, crop improvement can be accomplished either through biotechnological 
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means or by directed conventional breeding programs (for review see Tanksley and 
McCouch, Science 277:1063 (1997)). 

In another embodiment, the SDFs can be used to help create physical maps of 
the genome of corn, Arabidopsis and related species. Where SDFs have been ordered 
on a genetic map, as described above, they can be used as probes to discover which 
clones in large libraries of plant DNA fragments in YACs, BACs, etc. contain the 
same SDF or similar sequences, thereby facilitating the assignment of the large DNA 
fragments to chromosomal positions. Subsequently, the large BACs, YACs, etc. can 
be ordered unambiguously by more detailed studies of their sequence composition 
(e.g. Marra et al. (1997) Genomic Research 7:1072-1084) and by using their end or 
other sequences to find the identical sequences in other cloned DNA fragments. The 
overlapping of DNA sequences in this way allows large contigs of plant sequences to 
be built that, when sufficiently extended, provide a complete physical map of a 
chromosome. Sometimes the SDFs themselves will provide the means of joining 
cloned sequences into a contig. 

The patent publication WO95/35505 and U.S. Patents 5,445,943 and 
5,410,270 describe scanning multiple alleles of a plurality of loci using hybridization 
to arrays of oligonucleotides. These techniques are useful for each of the types of 
mapping discussed above. 



SDFs of the present invention, any individual can be genotyped. These individual 
genotypes can be used for the identification of particular cultivars, varieties, lines, 
ecotypes and genetically modified plants or can serve as tools for subsequent genetic 
studies involving multiple phenotypic traits. 

B.3 Southern Blot Hybridization 

The sequences from the REF and SEQ Tables can be used as probes for 
various hybridization techniques. These techniques are useful for detecting target 
polynucleotides in a sample or for determining whether transgenic plants, seeds or 
host cells harbor a gene or sequence of interest and thus might be expected to exhibit 
a particular trait or phenotype. 

In addition, the SDFs from the invention can be used to isolate additional 
members of gene families from the same or different species and/or orthologous genes 
from the same or different species. This is accomplished by hybridizing an SDF to, 



Following the procedures described above and using a plurality of the 
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for example, a Southern blot containing the appropriate genomic DNA or cDNA. 
Given the resulting hybridization data, one of ordinary skill in the art could 
distinguish and isolate the correct DNA fragments by size, restriction sites, sequence 
and stated hybridization conditions from a gel or from a library. 

Identification and isolation of orthologous genes from closely related species 
and alleles within a species is particularly desirable because of their potential for crop 
improvement. Many important crop traits, such as the solid content of tomatoes, 
result from the combined interactions of the products of several genes residing at 
different loci in the genome. Generally, alleles at each of these loci can make 
quantitative differences to the trait. By identifying and isolating numerous alleles for 
each locus from within or different species, transgenic plants with various 
combinations of alleles can be created and the effects of the combinations measured. 
Once a more favorable allele combination has been identified, crop improvement can 
be accomplished either through biotechnological means or by directed conventional 
breeding programs (Tanksley et al. Science 277:1063(1997)). 

The results from hybridizations of the SDFs of the invention to, for example, 
Southern blots containing DNA from another species can also be used to generate 
restriction fragment maps for the corresponding genomic regions. These maps 
provide additional information about the relative positions of restriction sites within 
fragments, further distinguishing mapped DNA from the remainder of the genome. 

Physical maps can be made by digesting genomic DNA with different 
combinations of restriction enzymes. 

Probes for Southern blotting to distinguish individual restriction fragments can 
range in size from 15 to 20 nucleotides to several thousand nucleotides. More 
preferably, the probe is 100 to 1,000 nucleotides long for identifying members of a 
gene family when it is found that repetitive sequences would complicate the 
hybridization. For identifying an entire corresponding gene in another species, the 
probe is more preferably the length of the gene, typically 2,000 to 10,000 nucleotides, 
but probes 50-1,000 nucleotides long might be used. Some genes, however, might 
require probes up to 1,500 nucleotides long or overlapping probes constituting the 
full-length sequence to span their lengths. 

Also, while it is preferred that the probe be homogeneous with respect to its 
sequence, it is not necessary. For example, as described below, a probe representing 
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members of a gene family having diverse sequences can be generated using PCR to 
amplify genomic DNA or RNA templates using primers derived from SDFs that 
include sequences that define the gene family. 

For identifying corresponding genes in another species, the next most 
preferable probe is a cDNA spanning the entire coding sequence, which allows all of 
the mRNA-coding fragment of the gene to be identified. Probes for Southern blotting 
can easily be generated from SDFs by making primers having the sequence at the 
ends of the SDF and using corn or Arabidopsis genomic DNA as a template. In 
instances where the SDF includes sequence conserved among species, primers 
including the conserved sequence can be used for PCR with genomic DNA from a 
species of interest to obtain a probe. 

Similarly, if the SDF includes a domain of interest, that fragment of the SDF can be 
used to make primers and, with appropriate template DNA, used to make a probe to 
identify genes containing the domain. Alternatively, the PCR products can be 
resolved, for example by gel electrophoresis, and cloned and/or sequenced. Using 
Southern hybridization, the variants of the domain among members of a gene family, 
both within and across species, can be examined. 

B.4.1 Isolating DNA from Related Organisms 

The SDFs of the invention can be used to isolate the corresponding DNA from 
other organisms. Either cDNA or genomic DNA can be isolated. For isolating 
genomic DNA, a lambda, cosmid, BAC or YAC, or other large insert genomic library 
from the plant of interest can be constructed using standard molecular biology 
techniques as described in detail by Sambrook et al. 1989 (Molecular Cloning: A 
Laboratory Manual, 2 nd ed. Cold Spring Harbor Laboratory Press, New York) and by 
Ausubel et al. 1992 (Current Protocols in Molecular Biology, Greene Publishing, New 
York). 

To screen a phage library, for example, recombinant lambda clones are plated 
out on appropriate bacterial medium using an appropriate E. coli host strain. The 
resulting plaques are lifted from the plates using nylon or nitrocellulose filters. The 
plaque lifts are processed through denaturation, neutralization, and washing 
treatments following the standard protocols outlined by Ausubel et al. (1992). The 
plaque lifts are hybridized to either radioactively labeled or non-radioactively labeled 
SDF DNA at room temperature for about 16 hours, usually in the presence of 50% 
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formamide and 5X SSC (sodium chloride and sodium citrate) buffer and blocking 
reagents. The plaque lifts are then washed at 42°C with 1% Sodium Dodecyl Sulfate 
(SDS) and at a particular concentration of SSC. The SSC concentration used is 
dependent upon the stringency at which hybridization occurred in the initial Southern 
blot analysis performed. For example, if a fragment hybridized under medium 
stringency (e.g., Tm - 20 °C), then this condition is maintained or preferably adjusted 
to a less stringent condition (e.g., Tm-30°C) to wash the plaque lifts. Positive clones 
show detectable hybridization e.g., by exposure to X-ray films or chromogen 
formation. The positive clones are then subsequently isolated for purification using 
the same general protocol outlined above. Once the clone is purified, restriction 
analysis can be conducted to narrow the region corresponding to the gene of interest. 
The restriction analysis and succeeding subcloning steps can be done using 
procedures described by, for example Sambrook et al. (1989) cited above. 

The procedures outlined for the lambda library are essentially similar to those 
used for YAC library screening, except that the YAC clones are harbored in bacterial 
colonies. The YAC clones are plated out at reasonable density on nitrocellulose or 
nylon filters supported by appropriate bacterial medium in petri plates. Following the 
growth of the bacterial clones, the filters are processed through the denaturation, 
neutralization, and washing steps following the procedures of Ausubel et al. 1992. 
The same hybridization procedures for lambda library screening are followed. 

To isolate cDNA, similar procedures using appropriately modified vectors are 
employed. For instance, the library can be constructed in a lambda vector appropriate 
for cloning cDNA such as A-gtll. Alternatively, the cDNA library can be made in a 
plasmid vector. cDNA for cloning can be prepared by any of the methods known in 
the art, but is preferably prepared as described above. Preferably, a cDNA library will 
include a high proportion of full-length clones. 

B. 5. Isolating and/or Identifying Orthologous Genes 
Probes and primers of the invention can be used to identify and/or isolate 
polynucleotides related to those in the REF and SEQ Tables. Related polynucleotides 
are those that are native to other plant organisms and exhibit either similar sequence or 
encode polypeptides with similar biological activity. One specific example is an 
orthologous gene. Orthologous genes have the same functional activity. As such, 
orthologous genes may be distinguished from homologous genes. The percentage of 
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identity is a function of evolutionary separation and, in closely related species, the 
percentage of identity can be 98 to 100%. The amino acid sequence of a protein encoded 
by an orthologous gene can be less than 75% identical, but tends to be at least75% or at 
least 80% identical, more preferably at least 90%, most preferably at least 95% identical 
to the amino acid sequence of the reference protein. 

To find orthologous genes, the probes are hybridized to nucleic acids from a species of 
interest under low stringency conditions, preferably one where sequences containing as 
much as 40-45% mismatches will be able to hybridize. This condition is established by 
T m - 40°C to Tm - 48°C (see below). Blots are then washed under conditions of 
increasing stringency. It is preferable that the wash stringency be such that sequences 
that are 85 to 100% identical will hybridize. More preferably, sequences 90 to 100% 
identical will hybridize and most preferably only sequences greater than 95% identical 
will hybridize. One of ordinary skill in the art will recognize that, due to degeneracy in 
the genetic code, amino acid sequences that are identical can be encoded by DNA 
sequences as little as 67% identical or less. Thus, it is preferable, for example, to make 
an overlapping series of shorter probes, on the order of 24 to 45 nucleotides, and 
individually hybridize them to the same arrayed library to avoid the problem of 
degeneracy introducing large numbers of mismatches. 

As evolutionary divergence increases, genome sequences also tend to diverge. 
Thus, one of skill will recognize that searches for orthologous genes between more 
divergent species will require the use of lower stringency conditions compared to 
searches between closely related species. Also, degeneracy of the genetic code is 
more of a problem for searches in the genome of a species more distant evolutionarily 
from the species that is the source of the SDF probe sequences. 

Therefore the method described in Bouckaert et al., U.S. Ser. No. 60/121,700 
Atty. Dkt. No. 2750-1 17P, Client Dkt. No. 00010.001, filed February 25, 1999, 
hereby incorporated in its entirety by reference, can be applied to the SDFs of the 
present invention to isolate related genes from plant species which do not hybridize to 
the corn Arabidopsis, soybean, rice, wheat, and other plant sequences of the REF and 
SEQ Tables. 

Identification of the relationship of nucleotide or amino acid sequences among 
plant species can be done by comparing the nucleotide or amino acid sequences of 
SDFs of the present application with nucleotide or amino acid sequences of other 
SDFs such as those present in applications listed in the table below: 
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United States 


2750-0501 P 


80135.001 ! 


7/23/99 60/145,224 


United States 


2750-0506P 
2750-0509P 


00070.001 I 
00071 .001 


7/26/99 60/145,276 
7/27/99 60/145,913 








7/27/99 60/145,918 




2750-0508P 


80136.002 i 


7/27/99 60/145,919 


United States 


2750-051 OP 


00072.001 : 
80137.001 ! 


7/28/99 60/145,951 
8/2/99 60/146,388 








8/2/99 60/146,389 










United States 


2750-051 4P 
2750-051 7P 


00074.001 I 

80138.002 i 


8/3/99 60/147,038 
8/4/99 60/147,302 


United States 


2750-051 5P 


00076.001 j 
00077.001 \ 


8/4/99 60/147,204 
8/5/99 60/147,260 


I United States 


2750-051 9P 


80136.003 ! 
00079.001 ! 


8/5/99 60/147,192 
8/6/99 60/147,416 
















8/9/99 60/147,493 


United States 


2750-0523P 
2750-0522P 


80139.002 ! 
80139.001 i 


8/9/99 60/147,935 
8/10/99 60/148,171 


i United States 


2750-0524P 


00081 .001 ! 


8/11/99 60/148,319 


United States 

Unitpri StatP^ 
United States 


2750-0530P 
2750-0528 P 

Cm. 1 \J\J WW^WI 

2750-0527P 


00082.001 I 
80141.004 i 
80141.003 I 

80141 .002 \ 


8/12/99 60/148,341 
8/12/99 60/148,337 
8/12/99 60/148,340 
8/12/99 60/148,342 
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United States 


2750-0532P 


00083 001 ^ 
80142.002 


8/13/99 60/148,565 
8/13/99 60/148,684 










United States 


2750-0534P 
2750-0535P 


80001 .003 
80001 .004 


8/17/99 60/149,928 
8/17/99 60/149,926 














00084.001 


8/17/99 60/149,175 


United States 


2750-0538P 


00085.001 ! 

80143.002 \ 


8/18/99 60/149,426 
8/20/99 60/149,929 


















United States 


2750-0540P 
2750-0543 P 


80143.001 j 
00088.001 I 


8/23/99 60/149,930 
8/23/99 60/149,902 










United States 
LJnitpd States 


2750-0547P 
2750 0545 P 


00090.001 
80144.001 
00091 .001 


8/26/99 60/150,884 
8/27/99 60/151,065 
8/27/99 60/151,080 








8/27/99 60/151,066 






00092.001 


8/30/99 60/151,303 


United States 


2750-0552P 


00093.001 
00094.001 


8/31/99 60/151,438 
9/1/99 60/151,930 








9/3/99 09/391,631 


International 
United States 


2750-0551 F(PC) 
2750-0554P 


80001.100 
00095.001 
00096.001 


9/3/99 99/204,38 
9/7/99 60/152,363 
9/10/99 60/153,070 
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iUnited States 
United States 
United States 


2750-0556P 
2750-0557P 
2750-0558P 


00098.001 ! 
00099.001 i 
00101.001 : 


9/13/99 60/153,758 
9/15/99 60/154,018 
9/16/99 60/154,039 


United States 
United States 


2750-0559P 
2750-0560P 


00102.001 
00103.001 I 


9/20/99 60/154,779 
9/22/99 60/155,139 


United States 
United States 


2750-0561 P 
2750-0562P 


00104.001 
00105.001 ! 


9/23/99 60/155,486 
9/24/99 60/155,659 


United States 


2750-0563P 


00106.001 ! 


9/28/99 60/156,458 


United States 


2750-0564P 


00107.001 : 


9/29/99 60/156,596 


\ United States 
United States 


2750-0570P 
2750-0565P 


00108.001 
80010.002 ; 


10/4/99 60/157,117 
10/5/99 09/413,198 










International 


2750-0569F(PC) 


80010.102 i 


10/5/99 99/228,53 


International 
United States 


2750-0568F(PC) 
2750-0566P 


80010.101 
80010.003 : 


10/5/99 99/228,54 
10/5/99 09/412,922 


: International 


2750-0567F(PC) 


80010.100 i 


10/5/99 99/228,55 


! United States 


2750-0572P 


00110.001 ! 


10/6/99 60/157,865 


United States 


2750-0575P 
2750-0576P 


00111.001 ; 
00112.001 


10/7/99 60/158,029 
10/8/99 60/158,232 


United States 


2750-0577P 


00113.001 


10/12/99 60/158,369 








10/13/99 60/159,295 


(United States 
United States 


2750-0583P 
2750-0579P 


80148.002 
80146.002 


10/13/99 60/159,294 
10/13/99 60/159,293 




2750-0581 P 


80147.002 


10/14/99 60/159,637 


United States 


2750-0582P 


80148.001 


10/14/99 60/159,329 


\ United States 


2750-0578 P 


80146.001 


10/14/99 60/159,331 


United States 


2750-0573P 
2750-0584P 


80145.001 
00116.001 


10/14/99 60/159,330 
10/18/99 60/159,584 


United States 
United States 


2750-0585P 
2750-0590P 


00118.001 
80150.002 


10/21/99 60/160,815 
10/21/99 60/160,767 




2750-0589P 


80150.001 


10/21/99 60/160,768 


United States 


2750-0588P 


00119.001 


10/21/99 60/160,741 


; United States 
United States 


2750-0586P 
2750-0587P 


80149.001 
80149.002 


10/21/99 60/160,814 
10/21/99 60/160,770 






00120.001 


10/22/99 60/160.980 


United States 


2750-0592P 


80151.001 


10/22/99 60/160,989 


United States 
United States 


2750-0593P 
2750-0595 P 


80151.002 
80152.001 


10/22/99 60/160,981 
10/25/99 60/161,406 




2750-0596P 


80152.002 


10/25/99 60/161,404 










United States 


2750-0599P 


80153.002 
00122.001 


10/26/99 60/161,359 
I 10/26/99 60/161,361 








! 10/26/99 60/161.360 




2750-0601 P 


00123.001 


10/28/99 '60/1 61 ,920 


United States 
United States 


2750-0603P 
2750-0600P 


80154.002 
80026.002 


i 10/28/99 60/161,993 
I 10/28/99 09/428,944 




2750-0602P 


80154.001 


I 10/28/99 60/161,992 


United States 


2750-0604P 


00124.001 


! 10/29/99 60/162,143 


United States 
United States 


2750-0605P 
2750-0606P 


80155.001 
80155.002 


10/29/99 60/162,142 
! 10/29/99 60/162,228 
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United States 
United States 


2750-0609P 
2750 0608P 
2750 0607 P 


80156.002 | 

80156.001 \ 

00125.001 I 


11/1/99 60/162,895 
11/1/99 60/162,891 
11/1/99 60/162,894 


United States 
United States 
United States 


2750-0612P 
2750-061 1P 
2750 0610P 
2750-061 3P 


80157.002 j 
80157.001 
00126.001 
00127.001 


11/2/99 60/163,091 
11/2/99 60/163,092 
11/2/99 60/163,093 
11/3/99 60/163,249 






80158.001 


11/3/99 60/163,248 




2750-061 5P 


80158.002 


11/3/99 60/163,281 


LJnitpd States 
Unitpd Statp<3 


2750-061 7P 
2750-061 8P 


80159.001 
80159.002 


11/4/99 60/163,381 
11/4/99 60/163,380 










IJnitpH Rtatp<5 


2750-061 9P 


00129.001 


11/8/99 60/164,146 


United States 


2750-0620P 
2750-0621 P 


80160.001 
80160.002 


11/8/99 60/164,151 
11/8/99 60/164,150 






80161 .002 


11/9/99 60/164,260 






80162.002 


11/9/99 60/164,259 


i United States 


2750-0630P 


80164.002 
80164.001 


11/10/99 60/164,548 
11/10/99 60/164,545 











United States 2750-0627P 80163.002 \ 11/10/99 60/164,318 

United States 2750-0626P 80163.001 \ 11/10/99 60/164,321 

United States 2750-0622P 80161.001 i 11/10/99 60/164,319 

United States 2750-0624P 80162.001 I 11/10/99 60/164,317 

United States 2750-0634P 00133.001 \ 11/12/99 60/164,870 

United States 2750-0636P 80166.002 \ 11/12/99 60/164,962 

United States 2750-0635P 80166.001 ! 1 1/12/99 60/164,959 

United States 2750-0631 P 00132.001 j 11/12/99 60/164,961 

United States 2750-0632P 80165.001 i 1 1/12/99 60/164,871 

United States 2750-0633P 80165.002 ! 11/12/99 60/164,960 

United States 2750-0639P 80167.002 ! 11/15/99 60/164,926 

United States 2750-0637P 00134.001 i 11/15/99 60/164,927 

United States 2750-0638P 80167.001 j 11/15/99 60/164,929 

United States 2750-0640P 00135.001 i 11/16/99 60/165,669 

United States 2750-0641 P 80168.001 i 11/16/99 60/165,671 

United States 2750-0642P 80168.002 i 11/16/99 60/165,661 

United States 2750-0645P 80169.002 i 11/17/99 60/165,911 

United States 2750-0643P 00136.001 ! 11/17/99 60/165,919 

United States 2750-0644P 80169.001 j 11/17/99 60/165,918 

United States 2750-0648P 80170.002 i 1 1/18/99 60/166,158 

United States 2750-0646P 00137.001 j 11/18/99 60/166,157 

United States 2750-0647P 80170.001 ! 11/18/99 60/166,173 

United States 2750-0649P 00139.001 11/19/99 60/166,419 

United States 2750-0650P 80171.001 \ 11/19/99 60/166,411 

United States 2750-0651 P 80171.002 \ 11/19/99 60/166,412 

United States 2750-0652P 00140.001 j 11/22/99 60/166,733 

United States 2750-0653P 80172.001 \ 11/22/99 60/166,750 

United States 2750-0655P 80173.002 ; 11/23/99 60/167,362 

United States 2750-0656P 00141.001 i 11/24/99 60/167,233 

United States 2750-0657P 80174.001 \ 11/24/99 60/167,234 

United States 2750-0654P 80173.001 j 1 1/24/99 60/167,382 
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United States 
Unitpd Statp<5 

United States 


2750-0658P 
2750-0659P 
2750-0660P 


80174.002 • 
00142 001 
80175.001 : 


1 1 /24/99 60/1 67.235 
1 1 /30/99 60/1 67 Q04 

11/30/99 60/167,908 




12750-0661 P 


80175.002 


11/30/99 60/167,902 


United States 

: Wl lllvU VlCllvu 


2750-0662P 


80042 002 ^ 


12/1/99 09/451 320 


LJnitpri State*; 
United States 


2750-0664P 
2750-0663P 


80176 001 
00143.001 1 


1 2/1 /QQ 60/1 6R 23R 
12/1/99 60/1 68,232 










\ United States 


2750-0668P 


80177.002 : 


12/2/99 60/168 548 


United States 
: United States 


2750-0667P 
2750-0666P 


80177 001 : 

: III m V/V/ 1 

00144.001 I 


1 2/2/99 60/1 68 549 
12/2/99 60/168,546 


United States 


2750-0670P 


80178.001 I 


12/3/99 60/168,673 


United States 


2750-0669P 


00145 001 ^ 


1 2/3/99 60/1 68 675 


United States 
United States 


2750-0671 P 
2750-0673P 


80178 002 i 
80179.001 : 


12/3/99 60/168 674 
12/7/99 60/169,278 


United States 


2750-0674P 


80179.002 \ 


12/7/99 60/169,302 


United States 


2750-0672P 


00147.001 ! 


12/7/99 60/169,298 


United States 


2750-0675P 
£. t i or 


80180.001 ! 


12/8/99 60/169,692 


United States 
United States 


2750-0678P 
2750-0679P 


80181.001 ! 
80181 002 ' 


12/16/99 60/171.114 
1 2/1 6/99 60/1 71 098 


United States 
United States 


2750-0677P 
2750-0683 P 


00149.001 i 

80060.002 5 


12/16/99 60/171,107 
1/4/00 09/478,081 


United States 


2750-0684P 


80070.002 ! 


1/7/00 09/479,221 


International 


2750-0686RPQ 


80070 100 


1/7/00 00/004 66 


1 II Lv7U OICIIC70 

United States 


p7^n-nfiRRP 
2750-0681 P 


ani R4 nnp ^ 
80182 002 i 


1 /1 Q/nn fin/1 7fi qi n 
1 /1 9/00 60/1 76 866 
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001 *S2 0f)1 

WW 1 \JC- .\J\J 1 


1/2K/00 R0/17R 1RR 

1 / £-\JI \J\J \ \J\Jf 1 # 0 , 1 \J\J 


United States 
United States 


2750-0690P 
2750-0687P 


100153.001 \ 
80184.001 i 


1/27/00 60/178,547 
1/27/00 60/178,545 


United States 


2750-0682P 


80183.001 ! 


1/27/00 60/178,546 


Unitpd Statp<? 


2750-0680P 


80182 001 


1 /27/00 60/1 78 544 

1 / ^— t 1 \J\J : \J\Jf 1 1 \J , W I I 


United States 
United States 


2750-0691 P 
2750-0692 P 


80185 001 ^ 
00155.001 \ 


1/27/00 60/177 666 
1/28/00 60/178,754 


United States 


2750-0693P 


180186.001 


1/28/00 60/178,755 


United States 


2750-0695 P 


00157 001 


2/1/00 60/179,395 


United States 


P7^o-ofiQfiP 
2750-0694P 


OU I O / .\J\J 1 

80084.002 ! 


2/1 /OO 60/17Q ^RR 

CLf 1 f\J\J .\J\Jf 1 / v7,OOU 

2/3/00 09/497,191 




2750-0697P 


00158.001 ! 


2/3/00 60/180.039 


^United States 


2750-0698P 


80188 001 


2/3/00 60/180 139 


United States 
United States 


2750-0699P 
2750-0700P 


00159 001 ; 
80189.001 ! 


2/4/00 60/1 80 206 
2/4/00 60/180,207 


United States 


2750-0701 P 


00160.001 ! 


2/7/00 60/180.695 


i United States 


2750-0702P 


80190.001 I 


2/7/00 60/180,696 


United States 
United States 


2750-0703P 
2750-0704P 


100161.001 : 
80191.001 \ 


2/9/00 60/181,228 
2/9/00 60/181 ,214 


United States 


2750-0705P 


00162.001 I 


2/10/00 60/181,476 


United States 


2750-0706P 


80192.001 


2/10/00 60/181,551 


i United States 
United States 


2750-071 3P 
2750-0707P 


80194.001 \ 
00163.001 ! 


2/15/00 60/182,478 
2/15/00 60/182,477 
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United States 


2750-0708P 


80193.001 ; 


2/15/00 60/182,516 


United States 
United States 


2750-071 2P 
2750-071 4P 


00164.001 I 
00165.001 i 


2/1 5/00 60/182,512 
2/17/00 60/183,166 


United States 


2750-071 5P 


80195.001 


2/17/00 60/183.165 


United States 


2750-071 6P 


00167.001 \ 


2/24/00 60/184,667 


United States 
Eurooe 


2750-071 7P 
2750 0709F(EP) 


80196.001 \ 
80090 103 


2/24/00 60/184,658 
2/25/00 00/301 439 


United States 


2750-0709P 


80090.002 I 


2/25/00 09/513.996 


I United States 


2750-071 9P 


00168.001 


2/25/00 60/185,118 


United States 

: 111 IwU ItilV/ W 

United States 


2750-071 8P 
2750-0720P 


91022 001 : 

. W 1 \J • WW 1 

80197.001 \ 


2/25/00 60/1 85 1 40 

^— / t^wy ww i i ww f i i 

2/25/00 60/185,119 


Canada 


!2750-0709F(CA) 


80090.102 \ 


2/25/00 23/006.92 


Mexico 


!2750-0709F(MX) 


80090.101 i 


2/25/00 00/001,973 


United States 
United States 


2750-0722P 
2750-0721 P 


00169 001 

.WW 1 Ww»WW 1 

91023.001 ! 


2/28/00 60/1 85 396 

£•/ £vW/ WW \J\J 1 1 WW j WW\^ 

2/28/00 60/185,398 




2750-0723P 


80198.001 I 


2/28/00 60/185.397 


United States 


2750-0724P 

C— / WW \y f 1 


91024 001 : 

w 1 Wfc»~.WW 1 


2/29/00 60/1 85 750 

fc/ fc- w/ WW WW/ 1 ww f » w W 


United States 

1 Init^rl QfotpiQ 


2750-071 OP 


80100.002 ; 
noi yn nm 


3/1/00 09/517,537 
3/1 /00 


United States 


2750-0726P 


80199.001 ] 


3/1/00 60/186,296 


United States 
United States 


2750-071 1 P 
2750-0728P 


00171.001 : 
80200.001 : 


3/2/00 60/186,390 
3/2/00 60/187,178 


! United States 


2750-0729P 


00172.001 \ 


3/2/00 60/186.386 


United States 


2750-0730P 


80201 .001 : 


3/2/00 60/186,387 


United States 
1 United States 


2750-0731 P 
2750-0732P 

1 WW w# w£- 1 


91026.001 \ 
00173 001 : 

WW 1 I WaWW 1 


3/3/00 60/186,670 
3/3/00 60/1 86 748 

\J 1 w/ WW WW/ 1 WW f f i w 


: United States 

United States 


2750-0733P 
2750-6735P 


80202.001 I 
91027 001 


3/3/00 60/186,669 
3/7/00 60/1 87 379 


United States 
' United States 


2750-0734P 
2750-0736P 


00174.001 : 
00175.001 I 


3/7/00 60/187,378 
3/8/00 60/187,896 


United States 


2750-0737P 


80203.001 I 


3/8/00 60/187,888 


United States 


2750-0738P 


91028.001 : 


3/9/00 60/187,985 


United States 
United States 


2750-0739P 

£— 1 ww w# WW 1 

2750-0742P 


00177 001 : 

: W W ill .WW 1 

00178.001 I 


3/10/00 60/188 187 

W/ 1 W/ WW . WW/ 1 Ww, 1 W 1 

3/10/00 60/188,185 




2750-0740P 


80204.001 \ 


3/10/00 60/188,186 


United States 


2750-0743P 


80205.001 \ 


3/10/00 60/188,175 


United States 
United States 


2750-0741 P 

; t— 1 WW V# / <T 1 | 

2750-0744P 


91030 001 

iW 1 WWW.WW 1 

91031.001 I 


3/10/00 60/188 174 
3/13/00 60/188,687 




2750-0745P 


00179.001 I 


3/14/00 60/189.080 


^ United States 


2750-0746P 


80206.001 ; 


3/14/00 60/189.052 


United States 
United States 


2750-0747P 
2750-0748P 


91032.001 j 
00180.001 : 


3/15/00 60/189,460 
3/15/00 60/189,461 


United States 


2750-0749P 


80207.001 ! 


3/15/00 60/189,462 


United States 


2750-0753P 


8021 1 .001 \ 


3/16/00 60/190,121 


' United States 
United States 


i 2750-0755 P 

Cm. I WW w / wwl 

2750-0751 P 


00181 001 

:WW 1 W ■ .WW 1 

80209.001 I 


3/1 6/00 60/1 89 953 

W/ 1 W/ WW WW/ 1 Ww j WWW 

3/16/00 60/189,947 


United States 


2750-0757P 


91034.001 ; 


3/16/00 60/189.965 


United States 


2750-0756P 


80212.001 \ 


3/16/00 60/189.959 


United States 
United States 


2750-0750P 
2750-0754P 


80208.001 
91033.001 I 


3/16/00 60/190.120 
3/16/00 60/189,958 
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1 JnitpH States 

United States 
United States 


2750-0752P 
2750-0762P 
2750-0761 P 


80210.001 
80214.001 j 
00183.001 ! 


3/16/00 60/189,948 
3/20/00 60/190,089 
3/20/00 60/190,545 


United States 


2750-0760P 
2750-0759P 


91035.001 \ 
80213.001 : 


3/20/00 60/190,060 
3/20/00 60/190,070 


United States 


2750-0758P 


00182.001 ! 


3/20/00 60/190,069 
3/22/00 60/191,084 


United States 


2750-0764P 
2750-0767P 


80215.001 L 
80216.001 ! 


3/22/00 60/191,097 
3/23/00 60/191,545 


United States 
United States 


2750-0766P 
2750-0765P 


00185.001 • 
91036.001 : 


3/23/00 60/191,543 
3/23/00 60/191,549 








3/24/00 60/191,825 








3/24/00 60/191,826 


United States 


2750-0769P 
2750-0772P 


00186.001 ; 
00187.001 ! 


3/24/00 60/191,823 
3/27/00 60/192,421 








3/27/00 60/192,308 








3/27/00 60/192,420 
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United States 


2750-1 046P 


80284.001 ! 


7/11/00 60/217,384 


United States 


2750-1 044P 


91081.001 : 


7/11/00 60/217,385 


United States 


2750-1 052P 


80287.002 ; 


7/12/00 06/218,548 


United States 


2750-1 050P 


80286.002 ! 


7/12/00 60/217,754 


\ Mexico 


l2750-0985F(MX) 


00059.102 I 


7/12/00 


Europe 


;2750-0985F(EP) 


00059.101 ! 


7/12/00 


Canada 


2750-0985F(CA) 


00059.100 : 


7/12/00 


United States 


2750-0985P 


00059.002 ; 


7/12/00 09/615.007 


United States 


2750-0986P 


00060.002 ! 


7/13/00 09/615,748 


United States 


2750-1 054P 


80288.002 ! 


7/13/00 60/217,846 


! Mexico 


2750-0986F(MX) 


00060.102 i 


7/13/00 


Canada 


!2750-0986F(CA) 


00060.100 ! 


7/13/00 


Europe 


!2750-0986F(EP) 


00060.101 i 


7/13/00 


i United States 


2750-0987P 


00061 .002 \ 


7/14/00 09/617,525 


United States 


2750-1061 P 


80134.018 I 


7/14/00 09/614,450 


Canada 


2750-0987F(CA) 


00061.100 ! 


7/14/00 


Europe 


:2750-0987F(EP) 


00061.101 : 


7/14/00 


Mexico 


I2750-0987F(MX) 


00061.102 ! 


7/14/00 


United States 


2750-0988P 


00062.002 I 


7/14/00 09/617,203 


Canada 


2750-0988F(CA) 


00062.100 


7/14/00 




I2750-0988F(EP) 


00062.101 ! 


7/14/00 


Mexico 


2750-0988F(MX) 


00062.102 \ 


7/14/00 


United States 


2750-1 048P 


80285.002 I 


7/14/00 60/218,566 




2750-1 060P 


80134.017 \ 


7/14/00 09/614,388 








7/18/00 60/219,033 


United States 


2750-1 055P 


91082.001 


7/18/00 60/219,021 


United States 


2750-1 056P 


00254.001 : 


7/18/00 60/219,004 


Canada 


2750-0989F(CA) 


00064.100 : 


7/19/00 


United States 


2750-1 063P 


80134.022 ! 


7/19/00 09/617,682 


United States 


2750-1 064P 


80134.024 \ 


7/19/00 09/617,681 


United States 


2750-0989P 


00064.002 ; 


7/19/00 09/620,421 


Europe 


2750-0989F(EP) 


00064.101 ! 


7/19/00 


! Mexico 


:2750-0989F(MX) 


00064.102 ; 


7/19/00 


United States 


2750-1 062P 


80134.020 \ 


7/19/00 09/617,683 


Europe 


2750-0990F(EP) 


00065.101 : 


7/20/00 


\ Mexico 


|2750-0990F(MX) 


00065.102 : 


7/20/00 


; United States 


2750-1 065P 


80134.026 ! 


7/20/00 09/620.998 


United States 


2750-0990P 


00065.002 \ 


7/20/00 09/621,323 


United States 


2750-1 066P 


80135.004 \ 


7/20/00 09/620,978 




;2750-0990F(CA) 


00065.100 \ 


7/20/00 


United Statp<5 


2750-1 072P 


80135.003 i 


7/2 1 /00 09/62 1.900 


j United States 


2750-1 069P 


80134.023 ! 


7/21/00 09/620,313 


Mexico 


2750-0992F(MX) 


00067.102 \ 


7/21/00 


Europe 


!2750-0992F(EP) 


00067.101 ! 


7/21/00 




2750-0992P 


00067.002 i 


7/21/00 09/621,660 


United States 


2750-1 073P 


80134.025 ! 


7/21/00 09/620,314 










Mexico 


2750-0991 F(MX) 


00066.102 \ 


7/21/00 


Europe 


2750-0991 F(EP) 


00066.101 ; 


7/21/00 


; Canada 


2750-0991 F(CA) 


00066.100 i 


7/21/00 


United States 


2750-0991 P 


00066.002 ! 


7/21/00 09/621,630 
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United States 2750-1 068P 80134.016 ! 7/21/00 09/620,393 

United States 2750-0993P 00069.002 \ 7/21/00 09/621,902 

Canada 2750-0992F(CA) 00067.100 i 7/21/00 

EuroDe 2750-0993 F(EP) 00069.101 • 7/21/00 

Mexico 2750-0993F(MX) 00069.102 j 7/21/00 

United States 2750-1 070P 80134.019 \ 7/21/00 09/620,111 

United States 2750-1 067P 80134.015 \ 7/21/00 09/620,394 

Canada 2750-0993F(CA) 00069.100 I 7/21/00 

United States 2750-1 058P 91083.001 ! 7/25/00 60/220,814 

United States 2750-1 081 P 80293.001 ! 7/25/00 60/220,652 

United States 2750-1 080P 00256.001 I 7/25/00 



United States 


2750-1 079P 


80292.001 


7/25/00 60/220,811 


United States 


2750-1 059P 


00255.001 i 


7/25/00 60/220,467 


: IVICAJV^Vy 




00070 102 ^ 


7/26/00 


Europe 


2750-0994F(EP) 


00070.101 ! 


7/26/00 


Canada 


i2750-0994F(CA) 


00070.100 \ 


7/26/00 


United States 


2750-0994P 


00070.002 \ 


7/26/00 09/616,628 


j Mexico 


2750-0995F(MX) 


00071.102 : 


7/27/00 


Europe 


!2750-0995F(EP) 


00071.101 \ 


7/27/00 


Canada 


2750-0995F(CA) 


00071.100 : 


7/27/00 


United States 


2750-0995P 


00071 .002 ; 


7/27/00 09/628,986 


United States 


2750-1 075P 


80136.005 \ 


7/27/00 09/628,987 


United States 


2750-1 074P 


80136.004 ! 


7/27/00 09/628,984 


Europe 


2750-0996F(EP) 


00072.101 : 


7/28/00 


Canada 


2750-0996F(CA) 


00072.100 \ 


7/28/00 


\ Mexico 


2750-0996F(MX) 


00072.102 : 


7/28/00 


United States 


2750-0996P 


00072.002 ! 


7/28/00 09/628,552 










United States 


2750-1 049P 


80286.001 ! 


8/1/00 


United States 


2750-1 077P 


80137.004 \ 


8/2/00 09/630,442 










United States 


2750-1 076P 


80137.003 \ 


8/2/00 09/628,985 


Canada 


I2750-0997F(CA) 


00073.100 ! 


8/2/00 


Europe 


!2750-0997F(EP) 


00073.101 \ 


8/2/00 


! Mexico 


2750-0997F(MX) 


00073.102 ; 


8/2/00 


; Mexico 


;2750-0998F(MX) 


00074.102 \ 


8/3/00 


Europe 


i2750-0998F(EP) 


00074.101 : 


8/3/00 


; United States 


2750-0998P 


00074.002 I 


8/3/00 09/632,349 


Canada 


2750-0998F(CA) 


00074.100 I 


8/3/00 


United States 


2750-1 051 P 


80287.001 ! 


8/3/00 


United States 


2750-1 255P 


80299.001 I 


8/3/00 60/223,098 


United States 


2750-1 257P 


80299.003 ! 


8/3/00 


United States 


2750-1 258P 


80299.004 \ 


8/3/00 


United States 


2750-1 053P 


80288.001 ; 


8/3/00 60/223.100 


United States 


2750-1 001 P 


00079.002 I 


8/4/00 09/633,239 


United States 


2750-0999P 


00076.002 ! 


8/4/00 09/633,051 


United States 


2750-1 000P 


00077.002 


8/4/00 09/633,191 


I United States 


2750-1 078P 


80138.003 ! 


8/4/00 09/635,640 


! Mexico 


2750-0999F(MX) 


00076.102 i 


8/4/00 


Canada 


2750-1 001 F(CA) 


00079.100 ! 


8/4/00 


I Canada 


2750-0999F(CA) 


00076.100 


8/4/00 
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United States 


2750-1 092P 


80138.004 : 


8/4/00 09/635,643 


i Furone 
Europe 


2750-1 001 F(EP) 
2750-0999F(EP) 


00079.101 \ 
00076.101 ! 


8/4/00 1 
8/4/00 


Europe 


12750-1 000F(EP) 


00077.101 


8/4/00 


\ Mexico 


2750-1001 F(MX) 


00079.102 I 


8/4/00 


: OCll IClUCt 

Mexico 


2750-1 000F(MX) 


00077 100 ; 
00077.102 7 


8/4/00 

\Jf if w w 

8/4/00 








8/7/00 60/223,329 


^ Mexico 


2750-1 002F(MX) 


00080.102 ! 


8/9/00 


Europe 

1 United States 


2750-1 002F(EP) 
2750-1 002P 


00080.101 ! 
00080.002 ! 


8/9/00 

8/9/00 09/635,277 










1 JnitpH fitatpQ 

w 1 1 1 L\Z7VJ O LCllCJO 


: 2750-1 1 15P 

: / \J\J 1 1 1 1 


92002 004 ' 


8/9/00 


United States 
Canada 


2750-1 11 4P 
2750-1 002F(CA) 


92001.004 I 
00080.100 : 


8/9/00 60/224,390 
8/9/00 


United States 


2750-1 093P 


80139.003 i 


8/10/00 09/635,641 


Europe 


2750-1 004F(EP) 


00082.101 ; 


8/11/00 


i Canada 


2750-1 004F(CA) 


00082.100 ; 


8/11/00 


Mexico 


2750-1 005F(MX) 


00083.102 : 


8/11/00 


United States 


2750-1 003P 


00081 002 


8/11/00 09/637,837 


United States 

; \J 1 1 1 L C U O LCI ICf O 


2750-1 102P 
2750-1 1 04P 
2750-1 003F(MX) 


80141.006 I 
80141 008 ^ 
00081.102 i 


8/11/00 09/637,565 
8/1 1 /00 


^ Eurooe 


2750-1 003F(EP) 


00081.101 \ 


8/11/00 


OcU IdUcl 

United States 


P7S0-1 OO^FfCA^ 
2750-1 103P 


00081 100 ^ 
80141.007 : 


8/1 1 /00 

8/11/00 09/637,564 


United States 


2750-11 01 P 
2750-1 005F(CA) 


80141 .005 [ 
00083.100 \ 


8/11/00 09/637,820 
8/11/00 


United States 
United States 


2750-1 005P 
2750-1 096P 


00083.002 ; 
180142.004 i 


8/11/00 09/637,563 
8/11/00 09/637,780 




12750-1 004F(MX) 


00082.102 ; 


8/11/00 


United States 


2750-1 083P 


91084.001 i 


8/14/00 


• Fi ironp 

United States 


2750-1 005F(EP) 
2750-1 082P 


00083.101 I 
80298.001 \ 


8/14/00 03/069,150 
8/14/00 


United States 


2750-1 085P 


191085.001 ! 


8/15/00 


l JnitpH States 


12750-1 084P 


80300 001 


8/1 5/00 


United States 
United States 


2750-1 095P 
2750-1 108P 


80142.003 i 
80295.002 I 


8/16/00 09/643,672 
8/16/00 






80296.002 \ 


8/16/00 60/225,850 


United States 


2750-1 1 12P 


80297.002 ! 


8/16/00 


United States 


2750-1 106P 
2750-1 006F(MX) 


80294.002 i 
00084.102 \ 


8/16/00 
8/17/00 




2750-1 006F(CA) 


00084.100 \ 


8/17/00 


Europe 


2750-1 006F(EP) 


00084.101 i 


8/17/00 


United States 
United States 


2750-1 006P 
2750-1 009P 


00084.002 ) 
00087.002 \ 


8/17/00 09/641,198 
8/18/00 09/640,695 




2750-1 098P 


80143.004 ! 


8/18/00 09/643,671 


mexico 


2750-1 009F(MX) 


00087.102 


8/18/00 


United States 


2750-1 007P 


00085.002 ; 
00087.101 I 


8/18/00 09/641,359 
8/18/00 



7\ -|— 4- /—\ v* t^i d \ 7 TvT 

uorncy inu. 
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United States 
Eurooe 


2750-1 1 07P 
2750-1 607F(EP) 


80295.001 ! 
00085.101 : 
00087.100 ! 


8/18/00 
8/18/00 
8/18/00 






80296.001 ! 


8/18/00 60/226,323 


United States 
l - Mexico 

: IVIw/\l W 


2750-1 1 05P 
2750-1 007F(MX) 
2750-1 007F(CA) 


80294.001 i 
00085.102 | 
00085.100 ! 


8/18/00 60/226,325 

8/18/00 

8/18/00 






80297.001 ! 


8/18/00 60/226,324 




2750-1 008F(MX) 


00086.102 


8/18/00 


United States 


2750-1 008P 


00086.002 
00086.100 I 


8/18/00 09/641,375 
8/18/00 






00086.101 j 


8/18/00 






80301 .001 I 


8/21/00 


United States 
Canada 


2750-1 087P 
2750-1 01 0F(CA) 


91086.001 : 
00088.100 ! 


8/21/00 
8/23/00 
8/23/00 




2750-1 01 0F(MX) 


00088.102 i 


8/23/00 


; United States 
United States 
United States 


2750-1 123P 
2750-1 1 32P 
2750-1 097P 
2750-1 1 22P 


80289.008 ! 
80289.01 7 ! 
80143.003 ! 
80289.007 \ 


8/23/00 
8/23/00 

8/23/00 09/649.866 
8/23/00 


United States 


2750-1 11 8P 
2750-1 117P 


80289.003 i 
80289.002 ! 


8/23/00 
8/23/00 






80289.001 ! 


8/23/00 








8/23/00 


United States 


2750-1 163P 


91087.001 : 
80307.001 \ 


8/23/00 
8/23/00 


United States 
United States 
United States 
United States 


2750-1 01 OP 
2750-1 1 24P 
2750-1 1 79P 
2750-1 178P 


00088.002 ! 
80289.009 \ 
80289.053 I 
80289.052 \ 


8/23/00 09/643.854 
8/23/00 

8/25/00 60/228.151 
8/25/00 60/228,159 








8/25/00 60/228,208 


United States 


2750-1 177P 


80289.051 I 
80289.041 I 


8/25/00 60/228,320 
8/25/00 60/228,047 


















United States 
[United States 


2750-1 175P 

: tiff f \^ \r ■ ■ # ■ 

2750-1 183P 


80289.049 I 
80289.057 I 


8/25/00 60/228.323 
8/25/00 60/228,179 
8/25/00 60/228,180 




2750-1 1 85P 


80289.059 i 


8/25/00 60/228,209 


United States 


2750-1 1 86P 
2750-1 182P 


80289.060 
80289.056 


8/25/00 60/228,178 
8/25/00 60/228,153 














80289.019 


8/25/00 


[United States 


2750-1 135P 


80289.020 
80289.021 


8/25/00 
8/25/00 


United States 
United States 
| United States 


2750-1 137P 
2750-1 143P 
2750-11 44 P 


80289.022 
80289.028 
80289.029 


8/25/00 
8/25/00 
8/25/00 



Attorney No. ^^0-1243P 

51 



Country 4 P Attorney No. I Client No. 1 E Filed I Application No 



United States 2750-1 140P 80289.025 i 8/25/00 

United States 2750-21 31 P 80289.216 ! 8/25/00 60/228,037 

United States 2750-1 172P 80289.046 \ 8/25/00 60/228,135 

United States 2750-11 71 P 80289.045 \ 8/25/00 60/228,152 

United States 2750-1 170P 80289.044 ! 8/25/00 60/228,132 

United States 2750-1 1 87P 80289.061 ! 8/25/00 60/228,177 

I United States 2750-1 165P 80289.039 I 8/25/00 60/228,046 

United States 2750-1 174P 80289.048 \ 8/25/00 60/228,156 

United States 2750-21 30P 80289.215 \ 8/25/00 60/227,777 

United States 2750-1 166P 80289.040 i 8/25/00 60/228,098 

United States 2750-1 120P 80289.005 ! 8/25/00 

Mexico 2750-1 01 3F(MX) 00091.102 \ 8/25/00 

Europe 2750-1 01 3F(EP) 00091.101 8/25/00 

United States 2750-1 147P 80289.032 . 8/25/00 

United States 2750-1 01 3P 00091.002 j 8/25/00 09/645,441 

United States 2750-1 1 46P 80289.031 \ 8/25/00 

United States 2750-1 173P 80289.047 j 8/25/00 60/228,322 

United States 2750-1 168P 80289.042 \ 8/25/00 60/228,052 

United States 2750-1 169P 80289.043 ! 8/25/00 60/228,049 

United States 2750-1 152P 80289.037 ! 8/25/00 

United States 2750-11 51 P 80289.036 i 8/25/00 

United States 2750-1 1 64P 80289.038 8/25/00 60/228,163 

Canada 2750-1 01 3F(CA) 00091.100 j 8/25/00 

United States 2750-1 21 9P 80289.093 \ 8/25/00 60/228,043 

Mexico 2750-1 01 2F(MX) 00090.102 ! 8/25/00 

United States 2750-1 142P 80289.027 8/25/00 60/227,733 

United States 2750-1 138P 80289.023 \ 8/25/00 60/227,729 

United States 2750-1 128P 80289.013 j 8/25/00 60/227,726 

United States 2750-1 139P 80289.024 j 8/25/00 60/228,167 

United States 2750-1 222P 80289.096 \ 8/25/00 60/228,061 

United States 2750-1 148P 80289.033 I 8/25/00 

United States 2750-1 220P 80289.094 \ 8/25/00 60/227,931 

United States 2750-1 119P 80289.004 i 8/25/00 60/228,029 

United States 2750-1 21 8P 80289.092 \ 8/25/00 60/227,929 

United States 2750-1 21 7P 80289.091 \ 8/25/00 60/228,217 

United States 2750-1 21 6P 80289.090 ■ 8/25/00 60/227.954 

United States :2750-1012P 00090.002 ! 8/25/00 09/648,708 

Canada 2750-1 01 2F(CA) 00090.100 ' 8/25/00 

United States 2750-1 099P 80144.003 ! 8/25/00 09/649,868 

United States 2750-1 221 P 80289.095 \ 8/25/00 60/228,187 

United States 2750-11 OOP 80144.004 \ 8/25/00 09/649,867 

United States 2750-21 27P 80289.212 \ 8/25/00 60/228,032 

United States 2750-21 25P 80289.210 ! 8/25/00 60/228,039 

United States 2750-21 24P 80289.209 \ 8/25/00 60/227,790 

United States 2750-21 23P 80289.208 \ 8/25/00 60/228,036 

United States 2750-21 22P 80289.207 i 8/25/00 60/228,038 

United States 2750-21 19P 80289.204 | 8/25/00 60/228,027 

United States 2750-11 41 P 80289.026 ! 8/25/00 

United States 2750-21 17P 80289.202 : 8/25/00 60/228,031 

United States 2750-1 21 5P 80289.089 \ 8/25/00 60/227,979 

United States 2750-1 133P 80289.018 ; 8/25/00 
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Mexico 2750-1 01 9F(MX) 00098.102 \ 9/13/00 

United States 2750-1 230P 91090.001 ; 9/13/00 

United States 2750-1 01 9P 00098.002 j 9/13/00 09/660,883 

Europe 2750-1 01 9F(EP) 00098.101 j 9/13/00 

United States 2750-1 231 P 8031 1 .001 i 9/15/00 

Canada 2750-1 020F(CA) 00099.1 00 j 9/15/00 

United States 2750-1 020P 00099,002 j 9/15/00 

United States 2750-1 021 P 001 01 ,002 j 9/15/00 

United States 2750-1 232P 91091.001 j 9/15/00 

Canada 2750-1 021 F(CA) 00101.100 j 9/15/00 

Europe 2750-1 020F(EP) 00099.101 ! 9/15/00 

Mexico 2750-1 021 F(MX) 00101.102 \ 9/15/00 

Mexico 2750-1 020F(MX) 00099.102 j 9/15/00 

Europe 2750-1 021 F(EP) 00101.101 j 9/15/00 

United States 2750-1 234P 91092.001 i 9/18/00 

United States 2750-1 233P 80312.001 ; 9/18/00 

United States 2750-1 253P 80313.001 j 9/20/00 

United States 2750-1 254P 91093.001 I 9/20/00 

Canada 2750-1 022F(CA) 00102,100 j 9/20/00 

Mexico 2750-1 022F(MX) 00102.102 j 9/20/00 

Europe 2750-1 022F(EP) 00102.101 \ 9/20/00 

United States 2750-1 022P 00102.002 \ 9/20/00 

United States 12750-1 261 P 91095,001 ! 9/21/00 

United States 2750-1 259P 80314,001 j 9/21/00 

United States 2750-1 260P 91094.001 \ 9/21/00 

Canada 2750-1 024F(CA) 00104.100 j 9/22/00 
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All applications listed in the table above are expressly incorporated herein by 
reference. 

The SDFs of the invention can also be used as probes to search for genes that 
are related to the SDF within a species. Such related genes are typically considered to 
be members of a gene family. In such a case, the sequence similarity will often be 
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concentrated into one or a few fragments of the sequence. The fragments of similar 
sequence that define the gene family typically encode a fragment of a protein or RNA 
that has an enzymatic or structural function. The percentage of identity in the amino 
acid sequence of the domain that defines the gene family is preferably at least 70%, 
more preferably 80 to 95%, most preferably 85 to 99%. To search for members of a 
gene family within a species, a low stringency hybridization is usually performed, but 
this will depend upon the size, distribution and degree of sequence divergence of 
domains that define the gene family. SDFs encompassing regulatory regions can be 
used to identify coordinately expressed genes by using the regulatory region sequence 
of the SDF as a probe. 

In the instances where the SDFs are identified as being expressed from genes that 
confer a particular phenotype, then the SDFs can also be used as probes to assay plants 
of different species for those phenotypes. 

I.C. Methods to Inhibit Gene Expression 

The nucleic acid molecules of the present invention can be used to inhibit gene 
transcription and/or translation. Example of such methods include, without limitation: 
Antisense Constructs; 
Ribozyme Constructs; 
Chimeraplast Constructs; 
Co-Suppression; 
Transcriptional Silencing; and 
Other Methods of Gene Expression. 

C.l Antisense 

In some instances it is desirable to suppress expression of an endogenous or 
exogenous gene. A well-known instance is the FLAVOR-SAVOR™ tomato, in 
which the gene encoding ACC synthase is inactivated by an antisense approach, thus 
delaying softening of the fruit after ripening. See for example, U.S. Patent No. 
5,859,330; U.S. Patent No. 5,723,766; Oeller, et al, Science, 254:437-439(1991); and 
Hamilton et al, Nature, 346:284-287 (1990). Also, timing of flowering can be 
controlled by suppression of the FLOWERING LOCUS C (FLC); high levels of this 
transcript are associated with late flowering, while absence of FLC is associated with 
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early flowering (S.D. Michaels et al., Plant Cell 11:949 (1999). Also, the transition of 



regulated by TERMINAL FLOWER1, APETALA1 and LEAFY . Thus, when it is 
desired to induce a transition from shoot production to flowering, it is desirable to 
suppress TFL1 expression (SJ. Liljegren, Plant Cell 11:1007 (1999)). As another 
instance, arrested ovule development and female sterility result from suppression of 
the ethylene forming enzyme but can be reversed by application of ethylene (D. De 
Martinis et al., Plant Cell 11:1061 (1999)). The ability to manipulate female fertility 
of plants is useful in increasing fruit production and creating hybrids. 

In the case of polynucleotides used to inhibit expression of an endogenous gene, 
the introduced sequence need not be perfectly identical to a sequence of the target 
endogenous gene. The introduced polynucleotide sequence will typically be at least 
substantially identical to the target endogenous sequence. 

Some polynucleotide SDFs in the REF and SEQ Tables represent sequences 
that are expressed in corn,wheat, rice, soybean Arabidopsis and/or other plants. Thus 
the invention includes using these sequences to generate antisense constructs to 
inhibit translation and/or degradation of transcripts of said SDFs, typically in a plant 
cell. 

To accomplish this, a polynucleotide segment from the desired gene that can 
hybridize to the mRNA expressed from the desired gene (the antisense segment") is 
operably linked to a promoter such that the antisense strand of RNA will be transcribed 
when the construct is present in a host cell. A regulated promoter can be used in the 
construct to control transcription of the antisense segment so that transcription occurs 
only under desired circumstances. 

The antisense segment to be introduced generally will be substantially identical 
to at least a fragment of the endogenous gene or genes to be repressed. The sequence, 
however, need not be perfectly identical to inhibit expression. Further, the antisense 
product may hybridize to the untranslated region instead of or in addition to the coding 
sequence of the gene. The vectors of the present invention can be designed such that the 
inhibitory effect applies to other proteins within a family of genes exhibiting homology 
or substantial homology to the target gene. 

For antisense suppression, the introduced antisense segment sequence also 
need not be full length relative to either the primary transcription product or the fully 
processed mRNA. Generally, a higher percentage of sequence identity can be used to 



apical meristem from production of leaves with associated shoots to flowering is 
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compensate for the use of a shorter sequence. Furthermore, the introduced sequence 
need not have the same intron or exon pattern, and homology of non-coding segments 
may be equally effective. Normally, a sequence of between about 30 or 40 
nucleotides and the full length of the transcript canbe used, though a sequence of at 
least about 100 nucleotides is preferred, a sequence of at least about 200 nucleotides 
is more preferred, and a sequence of at least about 500 nucleotides is especially 
preferred. 

C.2. Ribozymes 

It is also contemplated that gene constructs representing ribozymes and based 
on the SDFs in the REF and SEQ Tables are an object of the invention. Ribozymes 
can also be used to inhibit expression of genes by suppressing the translation of the 
mRNA into a polypeptide. It is possible to design ribozymes that specifically pair with 
virtually any target RNA and cleave the phosphodiester backbone at a specific location, 
thereby functionally inactivating the target RNA. In carrying out this cleavage, the 
ribozyme is not itself altered, and is thus capable of recycling and cleaving other 
molecules, making it a true enzyme. The inclusion of ribozyme sequences within 
antisense RNAs confers RNA-cleaving activity upon them, thereby increasing the 
activity of the constructs. 

A number of classes of ribozymes have been identified. One class of 
ribozymes is derived from a number of small circular RNAs, which are capable of 
self-cleavage and replication in plants. The RNAs replicate either alone (viroid 
RNAs) or with a helper virus (satellite RNAs). Examples include RNAs from 
avocado sunblotch viroid and the satellite RNAs from tobacco ringspot virus, lucerne 
transient streak virus, velvet tobacco mottle virus, solanum nodiflorum mottle virus 
and subterranean clover mottle virus. The design and use of target RNA-specific 
ribozymes is described in Haseloff et al. Nature, 334:585 (1988). 

Like the antisense constructs above, the ribozyme sequence fragment 
necessary for pairing need not be identical to the target nucleotides to be cleaved, nor 
identical to the sequences in the REF and SEQ Tables. Ribozymes may be 
constructed by combining the ribozyme sequence and some fragment of the target 
gene which would allow recognition of the target gene mRNA by the resulting 
ribozyme molecule. Generally, the sequence in the ribozyme capable of binding to 
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the target sequence exhibits a percentage of sequence identity with at least 80%, 
preferably with at least 85%, more preferably with at least 90% and most preferably with 
at least 95%, even more preferably, with at least 96%, 97%, 98% or 99% sequence 
identity to some fragment of a sequence in the REF and SEQ Tables or the 
complement thereof. The ribozyme can be equally effective in inhibiting mRNA 
translation by cleaving either in the untranslated or coding regions. Generally, a 
higher percentage of sequence identity can be used to compensate for the use of a 
shorter sequence. Furthermore, the introduced sequence need not have the same intron 
or exon pattern, and homology of non-coding segments may be equally effective. 

C.3. Chimeraplasts 

The SDFs of the invention, such as those described by the REF and SEQ 
Tables, can also be used to construct chimeraplasts that can be introduced into a cell 
to produce at least one specific nucleotide change in a sequence corresponding to the 
SDF of the invention. A chimeraplast is an oligonucleotide comprising DNA and/or 
RNA that specifically hybridizes to a target region in a manner which creates a 
mismatched base-pair. This mismatched base-pair signals the cell's repair enzyme 
machinery which acts on the mismatched region resulting in the replacement, 
insertion or deletion of designated nucleotide(s). The altered sequence is then 
expressed by the cell's normal cellular mechanisms. Chimeraplasts can be designed 
to repair mutant genes, modify genes, introduce site-specific mutations, and/or act to 
interrupt or alter normal gene function (US Pat. Nos. 6,010,907 and 6,004,804; and 
PCT Pub. No. W099/58723 and WO99/07865). 

C.4. Sense Suppression 

The SDFs of the REF and SEQ Tables of the present invention are also useful 
to modulate gene expression by sense suppression. Sense suppression represents 
another method of gene suppression by introducing at least one exogenous copy or 
fragment of the endogenous sequence to be suppressed. 

Introduction of expression cassettes in which a nucleic acid is configured in the 
sense orientation with respect to the promoter into the chromosome of a plant or by a 
self-replicating virus has been shown to be an effective means by which to induce 
degradation of mRNAs of target genes. For an example of the use of this method to 
modulate expression of endogenous genes see, Napoli et al., The Plant Cell 2:279 



Attorney No 



-1243P 



60 




(1990), and U.S. Patents Nos. 5,034,323, 5,231,020, and 5,283,184. Inhibition of 
expression may require some transcription of the introduced sequence. 

For sense suppression, the introduced sequence generally will be substantially 
identical to the endogenous sequence intended to be inactivated. The minimal 
percentage of sequence identity will typically be greater than about 65%, but a higher 
percentage of sequence identity might exert a more effective reduction in the level of 
normal gene products. Sequence identity of more than about 80% is preferred, though 
about 95% to absolute identity would be most preferred. As with antisense regulation, 
the effect would likely apply to any other proteins within a similar family of genes 
exhibiting homology or substantial homology to the suppressing sequence. 

C.5. Transcriptional Silencing 

The nucleic acid sequences of the invention, including the SDFs of the REF and 
SEQ Tables, and fragments thereof, contain sequences that can be inserted into the 
genome of an organism resulting in transcriptional silencing. Such regulatory sequences 
need not be operatively linked to coding sequences to modulate transcription of a gene. 
Specifically, a promoter sequence without any other element of a gene can be introduced 
into a genome to transcriptionally silence an endogenous gene (see, for example, 
Vaucheret, H et al. (1998) The Plant Journal 16: 651-659). As another example, triple 
helices can be formed using oligonucleotides based on sequences from the REF and 
SEQ Tables, fragments thereof, and substantially similar sequence thereto. The 
oligonucleotide can be delivered to the host cell and can bind to the promoter in the 
genome to form a triple helix and prevent transcription. An oligonucleotide of interest is 
one that can bind to the promoter and block binding of a transcription factor to the 
promoter. In such a case, the oligonucleotide can be complementary to the sequences of 
the promoter that interact with transcription binding factors. 



C.6. Other Methods to Inhibit Gene Expression 

Yet another means of suppressing gene expression is to insert a polynucleotide 
into the gene of interest to disrupt transcription or translation of the gene. 

Low frequency homologous recombination can be used to target a 
polynucleotide insert to a gene by flanking the polynucleotide insert with sequences that 
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are substantially similar to the gene to be disrupted. Sequences from the REF and SEQ 
Tables, fragments thereof, and substantially similar sequence thereto can be used for 
homologous recombination. 

In addition, random insertion of polynucleotides into a host cell genome can also 
be used to disrupt the gene of interest. Azpiroz-Leehan et ah, Trends in Genetics 13:152 
(1997). In this method, screening for clones from a library containing random insertions 
is preferred to identifying those that have polynucleotides inserted into the gene of 
interest. Such screening can be performed using probes and/or primers described above 
based on sequences from the REF and SEQ Tables, fragments thereof, and substantially 
similar sequence thereto. The screening can also be performed by selecting clones or Ri 
plants having a desired phenotype. 

ID. Methods of Functional Analysis 

The constructs described in the methods under I.C. above can be used to 
determine the function of the polypeptide encoded by the gene that is targeted by the 
constructs. 

Down-regulating the transcription and translation of the targeted gene in the 
host cell or organisms, such as a plant, may produce phenotypic changes as compared 
to a wild-type cell or organism. In addition, in vitro assays can be used to determine 
if any biological activity, such as calcium flux, DNA transcription, nucleotide 
incorporation, etc., are being modulated by the down-regulation of the targeted gene. 

Coordinated regulation of sets of genes, e.g., those contributing to a desired 
polygenic trait, is sometimes necessary to obtain a desired phenotype. SDFs of the 
invention representing transcription activation and DNA binding domains can be 
assembled into hybrid transcriptional activators. These hybrid transcriptional 
activators can be used with their corresponding DNA elements (i.e., those bound by 
the DNA-binding SDFs) to effect coordinated expression of desired genes (J.J. 
Schwarz et al., Mol Cell Biol 12:266 (1992), A. Martinez et al., Mol Gen. Genet 
261:546 (1999)). 

The SDFs of the invention can also be used in the two-hybrid genetic systems 
to identify networks of protein-protein interactions (L. McAlister-Henn et al., 
Methods 12:330 (1999), J.C. Hu et al., Methods 20:80 (2000), M. Golovkin et al., /. 
Biol Chem. 274:36428 (1999), K. Ichimura et al., Biochem. Biophys. Res. Comm. 
253 :532 (1998)). The SDFs of the invention can also be used in various expression 
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display methods to identify important protein-DNA interactions (e.g. B. Luo et al., J. 
Mol Biol 266:479 (1997)). 



I.E. Promoters 

The SDFs of the invention are also useful as structural or regulatory sequences in 
a construct for modulating the expression of the corresponding gene in a plant or other 
organism, e.g. a symbiotic bacterium. For example, promoter sequences associated to 
SDFs of the REF and SEQ Tables of the present invention can be useful in directing 
expression of coding sequences either as constitutive promoters or to direct expression in 
particular cell types, tissues, or organs or in response to environmental stimuli. 

With respect to the SDFs of the present invention a promoter is likely to be a 
relatively small portion of a genomic DNA (gDNA) sequence located in the first 2000 
nucleotides upstream from an initial exon identified in a gDNA sequence or initial 
ATG" or methionine codon or translational start site in a corresponding cDNA 
sequence. Such promoters are more likely to be found in the first 1000 nucleotides 
upstream of an initial ATG or methionine codon or translational start site of a cDNA 
sequence corresponding to a gDNA sequence. In particular, the promoter is usually 
located upstream of the transcription start site. The fragments of a particular gDNA 
sequence that function as elements of a promoter in a plant cell will preferably be found 
to hybridize to gDNA sequences presented and described in the REF and SEQ Tables at 
medium or high stringency, relevant to the length of the probe and its base composition. 

Promoters are generally modular in nature. Promoters can consist of a basal 
promoter that functions as a site for assembly of a transcription complex comprising an 
RNA polymerase, for example RNA polymerase II. A typical transcription complex will 
include additional factors such as TF n B, TF n D, and TF U E. Of these, TF n D appears to be 
the only one to bind DNA directly. The promoter might also contain one or more 
enhancers and/or suppressors that function as binding sites for additional transcription 
factors that have the function of modulating the level of transcription with respect to 
tissue specificity and of transcriptional responses to particular environmental or 
nutritional factors, and the like. 

Short DNA sequences representing binding sites for proteins can be separated 
from each other by intervening sequences of varying length. For example, within a 
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particular functional module, protein binding sites may be constituted by regions of 5 to 
60, preferably 10 to 30, more preferably 10 to 20 nucleotides. Within such binding sites, 
there are typically 2 to 6 nucleotides that specifically contact amino acids of the nucleic 
acid binding protein. The protein binding sites are usually separated from each other by 
10 to several hundred nucleotides, typically by 15 to 150 nucleotides, often by 20 to 50 
nucleotides. DNA binding sites in promoter elements often display dyad symmetry in 
their sequence. Often elements binding several different proteins, and/or a plurality of 
sites that bind the same protein, will be combined in a region of 50 to 1,000 basepairs. 

Elements that have transcription regulatory function can be isolated from their 
corresponding endogenous gene, or the desired sequence can be synthesized, and 
recombined in constructs to direct expression of a coding region of a gene in a desired 
tissue-specific, temporal-specific or other desired manner of inducibility or suppression. 
When hybridizations are performed to identify or isolate elements of a promoter by 
hybridization to the long sequences presented in the REF and SEQ Tables, conditions 
are adjusted to account for the above-described nature of promoters. For example short 
probes, constituting the element sought, are preferably used under low temperature 
and/or high salt conditions. When long probes, which might include several promoter 
elements are used, low to medium stringency conditions are preferred when hybridizing 
to promoters across species. 

If a nucleotide sequence of an SDF, or part of the SDF, functions as a 
promoter or fragment of a promoter, then nucleotide substitutions, insertions or 
deletions that do not substantially affect the binding of relevant DNA binding proteins 
would be considered equivalent to the exemplified nucleotide sequence. It is 
envisioned that there are instances where it is desirable to decrease the binding of 
relevant DNA binding proteins to silence or down-regulate a promoter, or conversely 
to increase the binding of relevant DNA binding proteins to enhance or up-regulate a 
promoter and vice versa. In such instances, polynucleotides representing changes to 
the nucleotide sequence of the DNA-protein contact region by insertion of additional 
nucleotides, changes to identity of relevant nucleotides, including use of chemically- 
modified bases, or deletion of one or more nucleotides are considered encompassed 
by the present invention. In addition, fragments of the promoter sequences described 
by the REF and SEQ Tables and variants thereof can be fused with other promoters or 
fragments to facilitate transcription and/or transcription in specific type of cells or 
under specific conditions. 



Attorney No. 



,0-1243? 



64 




Promoter function can be assayed by methods known in the art, preferably by 
measuring activity of a reporter gene operatively linked to the sequence being tested 
for promoter function. Examples of reporter genes include those encoding luciferase, 
green fluorescent protein, GUS, neo, cat and bar. 

I.F. UTRs and Junctions 

Polynucleotides comprising untranslated (UTR) sequences and intron/exon 
junctions are also within the scope of the invention. UTR sequences include introns and 
5' or y untranslated regions (5' UTRs or 3' UTRs). Fragments of the sequences 
shown in the REF and SEQ Tables can comprise UTRs and intron/exon junctions. 

These fragments of SDFs, especially UTRs, can have regulatory functions 
related to, for example, translation rate and mRNA stability. Thus, these fragments of 
SDFs can be isolated for use as elements of gene constructs for regulated production of 
polynucleotides encoding desired polypeptides. 

Introns of genomic DNA segments might also have regulatory functions. 
Sometimes regulatory elements, especially transcription enhancer or suppressor 
elements, are found within introns. Also, elements related to stability of 
heteronuclear RNA and efficiency of splicing and of transport to the cytoplasm for 
translation can be found in intron elements. Thus, these segments can also find use as 
elements of expression vectors intended for use to transform plants. 

Just as with promoters UTR sequences and intron/exon junctions can vary 
from those shown in the REF and SEQ Tables. Such changes from those sequences 
preferably will not affect the regulatory activity of the UTRs or intron/exon junction 
sequences on expression, transcription, or translation unless selected to do so. 
However, in some instances, down- or up-regulation of such activity may be desired 
to modulate traits or phenotypic or in vitro activity. 

I.G. Coding Sequences 

Isolated polynucleotides of the invention can include coding sequences that 
encode polypeptides comprising an amino acid sequence encoded by sequences in the 
REF and SEQ Tables or an amino acid sequence presented in the REF and SEQ 
Tables. 

A nucleotide sequence encodes a polypeptide if a cell (or a cell free in vitro 
system) expressing that nucleotide sequence produces a polypeptide having the 
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recited amino acid sequence when the nucleotide sequence is transcribed and the 
primary transcript is subsequently processed and translated by a host cell (or a cell 
free in vitro system) harboring the nucleic acid. Thus, an isolated nucleic acid that 
encodes a particular amino acid sequence can be a genomic sequence comprising 
exons and introns or a cDNA sequence that represents the product of splicing thereof. 
An isolated nucleic acid encoding an amino acid sequence also encompasses 
heteronuclear RNA, which contains sequences that are spliced out during expression, 
and mRNA, which lacks those sequences. 

Coding sequences can be constructed using chemical synthesis techniques or 
by isolating coding sequences or by modifying such synthesized or isolated coding 
sequences as described above. 

In addition to coding sequences encoding the polypeptide sequences of the 
REF and SEQ Tables, which are native to corn, Arabidopsis, soybean, rice, wheat, 
and other plants the isolated polynucleotides can be polynucleotides that encode 
variants, fragments, and fusions of those native proteins. Such polypeptides are 
described below in part II. 

In variant polynucleotides generally, the number of substitutions, deletions or 
insertions is preferably less than 20%, more preferably less than 15%; even more 
preferably less than 10%, 5%, 3% or 1% of the number of nucleotides comprising a 
particularly exemplified sequence. It is generally expected that non-degenerate 
nucleotide sequence changes that result in 1 to 10, more preferably 1 to 5 and most 
preferably 1 to 3 amino acid insertions, deletions or substitutions will not greatly affect 
the function of an encoded polypeptide. The most preferred embodiments are those 
wherein 1 to 20, preferably 1 to 10, most preferably 1 to 5 nucleotides are added to, 
deleted from and/or substituted in the sequences specifically disclosed in the REF and 
SEQ Tables. 

Insertions or deletions in polynucleotides intended to be used for encoding a 
polypeptide preferably preserve the reading frame. This consideration is not so 
important in instances when the polynucleotide is intended to be used as a hybridization 
probe. 



II. Polypeptides and Proteins 

IIA. Native polypeptides and proteins 
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Polypeptides within the scope of the invention include both native proteins as 
well as variants, fragments, and fusions thereof. Polypeptides of the invention are 
those encoded by any of the six reading frames of sequences shown in the REF and 
SEQ Tables, preferably encoded by the three frames reading in the 5' to 3' direction 
of the sequences as shown. 

Native polypeptides include the proteins encoded by the sequences shown in 
the REF and SEQ Tables. Such native polypeptides include those encoded by allelic 
variants. 

Polypeptide and protein variants will exhibit at least 75% sequence identity to 
those native polypeptides of the REF and SEQ Tables. More preferably, the polypeptide 
variants will exhibit at least 85% sequence identity; even more preferably, at least 90% 
sequence identity; more preferably at least 95%, 96%, 97%, 98%, or 99% sequence 
identity. Fragments of polypeptide or fragments of polypeptides will exhibit similar 
percentages of sequence identity to the relevant fragments of the native polypeptide. 
Fusions will exhibit a similar percentage of sequence identity in that fragment of the 
fusion represented by the variant of the native peptide. 

Furthermore, polypeptide variants will exhibit at least one of the functional 
properties of the native protein. Such properties include, without limitation, protein 
interaction, DNA interaction, biological activity, immunological activity, receptor 
binding, signal transduction, transcription activity, growth factor activity, secondary 
structure, three-dimensional structure, etc. As to properties related to in vitro or in vivo 
activities, the variants preferably exhibit at least 60% of the activity of the native protein; 
more preferably at least 70%, even more preferably at least 80%, 85%, 90% or 95% of at 
least one activity of the native protein. 

One type of variant of native polypeptides comprises amino acid substitutions, 
deletions and/or insertions. Conservative substitutions are preferred to maintain the 
function or activity of the polypeptide. 

Within the scope of percentage of sequence identity described above, a 
polypeptide of the invention may have additional individual amino acids or amino acid 
sequences inserted into the polypeptide in the middle thereof and/or at the N-terminal 
and/or C-terminal ends thereof. Likewise, some of the amino acids or amino acid 
sequences may be deleted from the polypeptide. 
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A.l Antibodies 

Isolated polypeptides can be utilized to produce antibodies. Polypeptides of the 
invention can generally be used, for example, as antigens for raising antibodies by 
known techniques. The resulting antibodies are useful as reagents for determining the 
distribution of the antigen protein within the tissues of a plant or within a cell of a plant. 
The antibodies are also useful for examining the production level of proteins in various 
tissues, for example in a wild-type plant or following genetic manipulation of a plant, by 
methods such as Western blotting. 

Antibodies of the present invention, both polyclonal and monoclonal, may be 
prepared by conventional methods. In general, the polypeptides of the invention are first 
used to immunize a suitable animal, such as a mouse, rat, rabbit, or goat. Rabbits and 
goats are preferred for the preparation of polyclonal sera due to the volume of serum 
obtainable, and the availability of labeled anti-rabbit and anti-goat antibodies as 
detection reagents. Immunization is generally performed by mixing or emulsifying the 
protein in saline, preferably in an adjuvant such as Freund's complete adjuvant, and 
injecting the mixture or emulsion parenterally (generally subcutaneously or 
intramuscularly). A dose of 50-200 jig/injection is typically sufficient. Immunization is 
generally boosted 2-6 weeks later with one or more injections of the protein in saline, 
preferably using Freund's incomplete adjuvant. One may alternatively generate 
antibodies by in vitro immunization using methods known in the art, which for the 
purposes of this invention is considered equivalent to in vivo immunization. 

Polyclonal antisera is obtained by bleeding the immunized animal into a glass or 
plastic container, incubating the blood at 25°C for one hour, followed by incubating the 
blood at 4°C for 2-18 hours. The serum is recovered by centrifugation (e.g., l,000xg for 
10 minutes). About 20-50 ml per bleed may be obtained from rabbits. 

Monoclonal antibodies are prepared using the method of Kohler and Milstein, 
Nature 256: 495 (1975), or modification thereof. Typically, a mouse or rat is 
immunized as described above. However, rather than bleeding the animal to extract 
serum, the spleen (and optionally several large lymph nodes) is removed and dissociated 
into single cells. If desired, the spleen cells can be screened (after removal of 
nonspecifically adherent cells) by applying a cell suspension to a plate, or well, coated 
with the protein antigen. B-cells producing membrane-bound immunoglobulin specific 
for the antigen bind to the plate, and are not rinsed away with the rest of the suspension. 
Resulting B-cells, or all dissociated spleen cells, are then induced to fuse with myeloma 
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cells to form hybridomas, and are cultured in a selective medium (e.g., hypoxanthine, 
aminopterin, thymidine medium, HAT"). The resulting hybridomas are plated by 
limiting dilution, and are assayed for the production of antibodies which bind 
specifically to the immunizing antigen (and which do not bind to unrelated antigens). 
The selected Mab-secreting hybridomas are then cultured either in vitro {e.g., in tissue 
culture bottles or hollow fiber reactors), or in vivo (as ascites in mice). 

Other methods for sustaining antibody-producing B-cell clones, such as by EBV 

transformation, are known. 

If desired, the antibodies (whether polyclonal or monoclonal) may be labeled 
using conventional techniques. Suitable labels include fluorophores, chromophores, 
radioactive atoms (particularly 32 P and 125 I), electron-dense reagents, enzymes, and 
ligands having specific binding partners. Enzymes are typically detected by their 
activity. For example, horseradish peroxidase is usually detected by its ability to convert 
3,3',5,5'-tetramethylbenzidine (TNB) to a blue pigment, quantifiable with a 
spectrophotometer. 

A 7 Tn Vitro App»ffl ri ™™ of Polypeptides 

Some polypeptides of the invention will have enzymatic activities that are useful 
in vitro. For example, the soybean trypsin inhibitor (Kunitz) family is one of the 
numerous families of proteinase inhibitors. It comprises plant proteins which have 
inhibitory activity against serine proteinases from the trypsin and subtilisin families, 
thiol proteinases and aspartic proteinases. Thus, these peptides find in vitro use in 
protein purification protocols and perhaps in therapeutic settings requiring topical 
application of protease inhibitors. 

Delta-aminolevulinic acid dehydratase (EC 4,2.1,24) (ALAD) catalyzes the 
second step in the biosynthesis of heme, the condensation of two molecules of 5- 
aminolevulinate to form porphobilinogen and is also involved in chlorophyll 
biosynthesis(Kaczor et al. (1994) Plant Physiol. 1-4: 1411-7; Smith (1988) Biochem. 
J. 249: 423-8; Schneider (1976) Z. naturforsch. [C] 31: 55-63). Thus, ALAD proteins 
can be used as catalysts in synthesis of heme derivatives. Enzymes of biosynthetic 
pathways generally can be used as catalysts for in vitro synthesis of the compounds 
representing products of the pathway. 
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Polypeptides encoded by SDFs of the invention can be engineered to provide 
purification reagents to identify and purify additional polypeptides that bind to them. 
This allows one to identify proteins that function as multimers or elucidate signal 
transduction or metabolic pathways. In the case of DNA binding proteins, the 
polypeptide can be used in a similar manner to identify the DNA determinants of 
specific binding (S. Pierrou et al.,Anal Biochem. 229:99 (1995), S. Chusacultanachai 
et al.,J. Biol Chem. 274:23591 (1999), Q. Lin et al.,J. Biol Chem, 272:27274 



II.B . POLYPEPTIDE VARIANTS , FRAGMENTS, AND FUSIONS 
Generally, variants , fragments, or fusions of the polypeptides encoded by the 
maximum length sequence(MLS) can exhibit at least one of the activities of the 
identified domains and/or related polypeptides described in Sections (C) and (D) of the 
REF Tables corresponding to the MLS of interest. 



substitutions. Conservative substitutions, described above (see II.), are preferred to 
maintain the function or activity of the polypeptide. Such substitutions include 
conservation of charge, polarity, hydrophobicity, size, etc. For example, one or more 
amino acid residues within the sequence can be substituted with another amino acid of 
similar polarity that acts as a functional equivalent, for example providing a hydrogen 
bond in an enzymatic catalysis. Substitutes for an amino acid within an exemplified 
sequence are preferably made among the members of the class to which the amino acid 
belongs. For example, the nonpolar (hydrophobic) amino acids include alanine, leucine, 
isoleucine, valine, proline, phenylalanine, tryptophan and methionine. The polar neutral 
amino acids include glycine, serine, threonine, cysteine, tyrosine, asparagine, and 
glutamine. The positively charged (basic) amino acids include arginine, lysine and 
histidine. The negatively charged (acidic) amino acids include aspartic acid and 
glutamic acid. 

Within the scope of percentage of sequence identity described above, a 
polypeptide of the invention may have additional individual amino acids or amino acid 
sequences inserted into the polypeptide in the middle thereof and/or at the N-terminal 
and/or C-terminal ends thereof. Likewise, some of the amino acids or amino acid 



(1997)). 



II.B .(1) Variants 

A type of variant of the native polypeptides comprises amino acid 
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sequences may be deleted from the polypeptide. Amino acid substitutions may also be 
made in the sequences; conservative substitutions being preferred. 

One preferred class of variants are those that comprise (1) the domain 
of an encoded polypeptide and/or (2) residues conserved between the encoded 
polypeptide and related polypeptides. For this class of variants, the encoded 
polypeptide sequence is changed by insertion, deletion, or substitution at positions 
flanking the domain and/or conserved residues. 

Another class of variants includes those that comprise an encoded 
polypeptide sequence that is changed in the domain or conserved residues by a 
conservative substitution. 

Yet another class of variants includes those that lack one of the in vitro 
activities, or structural features of the encoded polypeptides. One example is 
polypeptides or proteins produced from genes comprising dominant negative 
mutations. Such a variant may comprise an encoded polypeptide sequence with non- 
conservative changes in a particular domain or group of conserved residues. 



identified for a polypeptide encoded by an MLS of the instant invention and variants 
thereof. Also, fragments that comprise at least one region of residues conserved between 
an MLS encoded polypeptide and its related polypeptides are of great interest. 
Fragments are sometimes useful as polypeptides corresponding to genes comprising 
dominant negative mutations are. 



encoded polypeptide or variants thereof of interest and (2) a fragment of a polypeptide 
comprising the same domain. For example, an AP2 helix encoded by a MLS of the 
invention fused to second AP2 helix from ANT protein, which comprises two AP2 
helices. The present invention also encompasses fusions of MLS encoded 
polypeptides, variants, or fragments thereof fused with related proteins or fragments 
thereof. 

DEFINITION OF DOMAINS 



II.A.(2) FRAGMENTS 

Fragments of particular interest are those that comprise a domain 



II.A.(3)FUSIONS 

Of interest are chimeras comprising (1) a fragment of the MLS 
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The polypeptides of the invention may possess identifying domains as shown 
in the REF Tables. Specific domains within the MLS encoded polypeptides are 
indicated in the REF Tables. In addition, the domains within the MLS encoded 
polypeptide can be defined by the region that exhibits at least 70% sequence identity 
with the consensus sequences listed in the detailed description below of each of the 
domains. 

The majority of the protein domain descriptions given in the protein domain 
table are obtained from Prosite, (http//www.expasy.ch/prosite/), and Pfam, 
(http//pfam.wustl.edu/browse.shtml). 



Polypeptides comprising signal peptides are a family of proteins that are 
typically targeted to (1) a particular organelle or intracellular compartment, (2) 
interact with a particular molecule or (3) for secretion outside of a host cell. Example 
of polypeptides comprising signal peptides include, without limitation, secreted 
proteins, soluble proteins, receptors, proteins retained in the ER, etc. 

These proteins comprising signal peptides are useful to modulate ligand- 
receptor interactions, cell-to-cell communication, signal transduction, intracellular 
communication, and activities and/or chemical cascades that take part in an organism 
outside or within of any particular cell. 

One class of such proteins are soluble proteins which are transported out of the 
cell. These proteins can act as ligands that bind to receptor to trigger signal 
transduction or to permit communication between cells. 

Another class is receptor proteins which also comprise a retention domain that 
lodges the receptor protein in the membrane when the cell transports the receptor to 
the surface of the cell. Like the soluble ligands, receptors can also modulate signal 
transduction and communication between cells. 



A. Activities of Polypeptides Comprising Signal Peptides 



In addition the signal peptide itself can serve as a ligand for some receptors. 
An example is the interaction of the ER targeting signal peptide with the signal 
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recognition particle (SRP). Here, the SRP binds to the signal peptide, halting 
translation, and the resulting SRP complex then binds to docking proteins located on 
the surface of the ER, prompting transfer of the protein into the ER. 



A description of signal peptide residue composition is described below in 
Subsection IV.C.l. 
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III. Methods of Modulating Polypeptide Production 

It is contemplated that polynucleotides of the invention can be incorporated 
into a host cell or in-vitro system to modulate polypeptide production. For instance, 
the SDFs prepared as described herein can be used to prepare expression cassettes useful 
in a number of techniques for suppressing or enhancing expression. 

An example are polynucleotides comprising sequences to be transcribed, such 
as coding sequences, of the present invention can be inserted into nucleic acid 
constructs to modulate polypeptide production. Typically, such sequences to be 
transcribed are heterologous to at least one element of the nucleic acid construct to 
generate a chimeric gene or construct. 

Another example of useful polynucleotides are nucleic acid molecules 
comprising regulatory sequences of the present invention. Chimeric genes or 
constructs can be generated when the regulatory sequences of the invention linked to 
heterologous sequences in a vector construct. Within the scope of invention are such 
chimeric gene and/or constructs. 

Also within the scope of the invention are nucleic acid molecules, whereof at 
least a part or fragment of these DNA molecules are presented in the REF and SEQ 
Tables of the present application, and wherein the coding sequence is under the control 
of its own promoter and/or its own regulatory elements. Such molecules are useful for 
transforming the genome of a host cell or an organism regenerated from said host cell for 
modulating polypeptide production. 

Additionally, a vector capable of producing the oligonucleotide can be inserted 
into the host cell to deliver the oligonucleotide. 

More detailed description of components to be included in vector constructs 
are described both above and below. 

Whether the chimeric vectors or native nucleic acids are utilized, such 
polynucleotides can be incorporated into a host cell to modulate polypeptide 
production. Native genes and/or nucleic acid molecules can be effective when 
exogenous to the host cell. 

Methods of modulating polypeptide expression includes, without limitation: 
Suppression methods, such as 
Antisense 
Ribozymes 
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Co-suppression 

Insertion of Sequences into the Gene to be Modulated 
Regulatory Sequence Modulation. 

as well as Methods for Enhancing Production, such as 
Insertion of Exogenous Sequences; and 
Regulatory Sequence Modulation. 



IILA. Suppression 

Expression cassettes of the invention can be used to suppress expression of 
endogenous genes which comprise the SDF sequence. Inhibiting expression can be 
useful, for instance, to tailor the ripening characteristics of a fruit (Oeller et al., 
Science 254:437 (1991)) or to influence seed size_(WO98/07842) or to provoke cell 
ablation (Mariani et al., Nature 357: 384-387 (1992). 

As described above, a number of methods can be used to inhibit gene 
expression in plants, such as antisense, ribozyme, introduction of exogenous genes 
into a host cell, insertion of a polynucleotide sequence into the coding sequence 
and/or the promoter of the endogenous gene of interest, and the like. 

III. A. 1. Antisense 

An expression cassette as described above can be transformed into host 
cell or plant to produce an antisense strand of RNA. For plant cells, antisense RNA 
inhibits gene expression by preventing the accumulation of mRNA which encodes the 
enzyme of interest, se% e.g., Sheehy et al., Proc. Nat. Acad. ScL USA, 85:8805 (1988), 
and Hiatt et al., U.S. Patent No. 4,801,340. 



III.A.2. Ribozymes 

Similarly, ribozyme constructs can be transformed into a plant to cleave 
mRNA and down-regulate translation. 



TTT. A .3. Co-Suppression 

Another method of suppression is by introducing an exogenous copy of 
the gene to be suppressed. Introduction of expression cassettes in which a nucleic acid 
configured in the sense orientation with respect to the promoter has been shown to 



A detailed description of this method is described 

IH.A.4. Insertion of Sequences i nto the Gene to be Modulated 

Yet another means of suppressing gene expression is to insert a 

polynucleotide into the gene of interest to disrupt transcription or translation of the gene. 

Homologous recombination could be used to target a polynucleotide 

insert to a gene using the Cre-Lox system (A.C. Vergunst et al., Nucleic Acids Res. 

26:2729 (1998), A.C. Vergunst et al., Plant Mol. Biol. 28:393 (1998), H. Albert et al., 

Plant J. 2:649 (1995)). 

In addition, random insertion of polynucleotides into a host cell genome 
can also be used to disrupt the gene of interest. Azpiroz-Leehan et al., Trends in 
Genetics 12:152 (1997). In this method, screening for clones from a library containing 
random insertions is preferred for identifying those that have polynucleotides inserted 
into the gene of interest. Such screening can be performed using probes and/or primers 
described above based on sequences from the REF and SEQ Tables, fragments thereof, 
and substantially similar sequence thereto. The screening can also be performed by 
selecting clones or any transgenic plants having a desired phenotype. 

TTT A S Regulatory SegnenceModulation 

The SDFs described in the REF and SEQ Tables, and fragments thereof 
are examples of nucleotides of the invention that contain regulatory sequences that can 
be used to suppress or inactivate transcription and/or translation from a gene of interest 
as discussed in I.C.5. 
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prevent the accumulation of mRNA. 
above. 



IH.A.6. Genes Comprising Dom inant-Negative Mutations 
When suppression of production of the endogenous, native protein is 
desired it is often helpful to express a gene comprising a dominant negative mutation. 
Production of protein variants produced from genes comprising dominant negative 
mutations is a useful tool for research Genes comprising dominant negative mutations 
can produce a variant polypeptide which is capable of competing with the native 
polypeptide, but which does not produce the native result. Consequently, over expression 
of genes comprising these mutations can titrate out an undesired activity of the native 
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protein. For example, The product from a gene comprising a dominant negative 
mutation of a receptor can be used to constitutively activate or suppress a signal 
transduction cascade, allowing examination of the phenotype and thus the trait(s) 
controlled by that receptor and pathway. Alternatively, the protein arising from the 
gene comprising a dominant-negative mutation can be an inactive enzyme still capable 
of binding to the same substrate as the native protein and therefore competes with such 
native protein. 

Products from genes comprising dominant-negative mutations can also 
act upon the native protein itself to prevent activity. For example, the native protein may 
be active only as a homo-multimer or as one subunit of a hetero-multimer. Incorporation 
of an inactive subunit into the multimer with native subunit(s) can inhibit activity. 

Thus, gene function can be modulated in host cells of interest by 
insertion into these cells vector constructs comprising a gene comprising a dominant- 
negative mutation. 

III.B. Enhanced Expression 

Enhanced expression of a gene of interest in a host cell can be accomplished by 
either (1) insertion of an exogenous gene; or (2) promoter modulation. 

III.B.l. Insertion of an Exogenous Gene 

Insertion of an expression construct encoding an exogenous gene can 
boost the number of gene copies expressed in a host cell. 

Such expression constructs can comprise genes that either encode the 
native protein that is of interest or that encode a variant that exhibits enhanced activity as 
compared to the native protein. Such genes encoding proteins of interest can be 
constructed from the sequences from the REF and SEQ Tables, fragments thereof, and 
substantially similar sequence thereto. 

Such an exogenous gene can include either a constitutive promoter 
permitting expression in any cell in a host organism or a promoter that directs 
transcription only in particular cells or times during a host cell life cycle or in response 
to environmental stimuli. 



IILB.2. Regulatory Sequence M odulation 
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The SDFs of the REF and SEQ Tables, and fragments thereof, contain 
regulatory sequences that can be used to enhance expression of a gene of interest. For 
example, some of these sequences contain useful enhancer elements. In some cases, 
duplication of enhancer elements or insertion of exogenous enhancer elements will 
increase expression of a desired gene from a particular promoter. As other examples, all 
11 promoters require binding of a regulatory protein to be activated, while some 
promoters may need a protein that signals a promoter binding protein to expose a 
polymerase binding site. In either case, over-production of such proteins can be used to 
enhance expression of a gene of interest by increasing the activation time of the 
promoter. 

Such regulatory proteins are encoded by some of the sequences in the 
REF and SEQ Tables, fragments thereof, and substantially similar sequences thereto. 
Coding sequences for these proteins can be constructed as described above. 



IV. Gene Constructs and Vector Construction 

To use isolated SDFs of the present invention or a combination of them or parts 
and/or mutants and/or fusions of said SDFs in the above techniques, recombinant DNA 
vectors which comprise said SDFs and are suitable for transformation of cells, such as 
plant cells, are usually prepared. The SDF construct can be made using standard 
recombinant DNA techniques (Sambrook et al. 1989) and can be introduced to the 
species of interest by Agrobacterium-mediated transformation or by other means of 
transformation {e.g., particle gun bombardment) as referenced below. 

The vector backbone can be any of those typical in the art such as plasmids, 
viruses, artificial chromosomes, BACs, YACs and PACs and vectors of the sort 
described by 

(a) BAC: Shizuya et al., Proc. Natl. Acad. Sci. USA 89: 8794-8797 

(1992); 

Hamilton et al., Proc. Natl. Acad. Sci. USA 93: 9975-9979 (1996); 

(b) YAC: Burke et al., Science 236:806-812 (1987);. 

(c) PAC: Sternberg N. et al., Proc Natl Acad Sci USA. Jan;87(l): 103-7 

(1990); 

(d) Bacteria- Yeast Shuttle Vectors: Bradshaw et al., Nucl Acids Res 23 
4850-4856 (1995); 
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(e) Lambda Phage Vectors: Replacement Vector, e.g., 
Frischauf et al., J. Mol Biol 170: 827-842 (1983); or Insertion vector, e.g., 

Huynh et al., In: Glover NM (ed) DNA Cloning: A practical Approach, Vol.1 Oxford: 
IRL Press (1985); 

(f) T-DNA gene fusion vectors :Walden et al., Mol Cell Biol 1: 175-194 
(1990); and 

(g) Plasmid vectors: Sambrook et al., infra. 

Typically, a vector will comprise the exogenous gene, which in its turn 
comprises an SDF of the present invention to be introduced into the genome of a host 
cell, and which gene may be an antisense construct, a ribozyme construct chimeraplast, 
or a coding sequence with any desired transcriptional and/or translational regulatory 
sequences, such as promoters, UTRs, and 3 5 end termination sequences. Vectors of the 
invention can also include origins of replication, scaffold attachment regions (SARs), 
markers, homologous sequences, introns, etc. 

A DNA sequence coding for the desired polypeptide, for example a cDNA 
sequence encoding a full length protein, will preferably be combined with transcriptional 
and translational initiation regulatory sequences which will direct the transcription of the 
sequence from the gene in the intended tissues of the transformed plant. 

For example, for over-expression, a plant promoter fragment may be employed 
that will direct transcription of the gene in all tissues of a regenerated plant. 

Alternatively, the plant promoter may direct transcription of an SDF of the 
invention in a specific tissue (tissue-specific promoters) or may be otherwise under more 
precise environmental control (inducible promoters). 

If proper polypeptide productionis desired, a polyadenylation region at the 3 -end 
of the coding region is typically included. The polyadenylation region can be derived 
from the natural gene, from a variety of other plant genes, or from T-DNA. 

The vector comprising the sequences from genes or SDF or the invention may 
comprise a marker gene that confers a selectable phenotype on plant cells. The vector 
can include promoter and coding sequence, for instance. For example, the marker may 
encode biocide resistance, particularly antibiotic resistance, such as resistance to 
kanamycin, G418, bleomycin, hygromycin, or herbicide resistance, such as resistance to 
chlorosulfuron or phosphinotricin. 



IV.A. Coding Sequences 
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Generally, the sequence in the transformation vector and to be introduced 



into the genome of the host cell does not need to be absolutely identical to an SDF of the 
present invention. Also, it is not necessary for it to be full length, relative to either the 
primary transcription product or fully processed mRNA. Furthermore, the introduced 
sequence need not have the same intron or exon pattern as a native gene. Also, 
heterologous non-coding segments can be incorporated into the coding sequence without 
changing the desired amino acid sequence of the polypeptide to be produced. 

IV.B. Promoters 

As explained above, introducing an exogenous SDF from the same species or 
an orthologous SDF from another species can modulate the expression of a native 
gene corresponding to that SDF of interest. Such an SDF construct can be under the 
control of either a constitutive promoter or a highly regulated inducible promoter 
(e.g., a copper inducible promoter). The promoter of interest can initially be either 
endogenous or heterologous to the species in question. When re-introduced into the 
genome of said species, such promoter becomes exogenous to said species. Over- 
expression of an SDF transgene can lead to co-suppression of the homologous 
endogeneous sequence thereby creating some alterations in the phenotypes of the 
transformed species as demonstrated by similar analysis of the chalcone synthase 
gene (Napoli et ah, Plant Cell 2:279 (1990) and van der Krol et al., Plant Cell 2:291 
(1990)). If an SDF is found to encode a protein with desirable characteristics, its 
over-production can be controlled so that its accumulation can be manipulated in an 
organ- or tissue-specific manner utilizing a promoter having such specificity. 

Likewise, if the promoter of an SDF (or an SDF that includes a promoter) is 
found to be tissue-specific or developmentally regulated, such a promoter can be 
utilized to drive or facilitate the transcription of a specific gene of interest (e.g., seed 
storage protein or root-specific protein). Thus, the level of accumulation of a 
particular protein can be manipulated or its spatial localization in an organ- or tissue- 
specific manner can be altered. 

TV. C Signal Peptides 

SDFs of the present invention containing signal peptides are indicated in the 
REF and SEQ Tables. In some cases it may be desirable for the protein encoded by 
an introduced exogenous or orthologous SDF to be targeted (1) to a particular 
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organelle intracellular compartment, (2) to interact with a particular molecule such as 
a membrane molecule or (3) for secretion outside of the cell harboring the introduced 
SDF. This will be accomplished using a signal peptide. 

Signal peptides direct protein targeting, are involved in ligand-receptor 
interactions and act in cell to cell communication. Many proteins, especially soluble 
proteins, contain a signal peptide that targets the protein to one of several different 
intracellular compartments. In plants, these compartments include, but are not limited 
to, the endoplasmic reticulum (ER), mitochondria, plastids (such as chloroplasts), the 
vacuole, the Golgi apparatus, protein storage vessicles (PSV) and, in general, 
membranes. Some signal peptide sequences are conserved, such as the Asn-Pro-Ile- 
Arg amino acid motif found in the N-terminal propeptide signal that targets proteins 
to the vacuole (Marty (1999) The Plant Cell 11: 587-599). Other signal peptides do 
not have a consensus sequence per se, but are largely composed of hydrophobic 
amino acids, such as those signal peptides targeting proteins to the ER (Vitale and 
Denecke (1999) The Plant Cell 11: 615-628). Still others do not appear to contain 
either a consensus sequence or an identified common secondary sequence, for 
instance the chloroplast stromal targeting signal peptides (Keegstra and Cline (1999) 
The Plant Cell 11: 557-570). Furthermore, some targeting peptides are bipartite, 
directing proteins first to an organelle and then to a membrane within the organelle 
(e.g. within the thylakoid lumen of the chloroplast; see Keegstra and Cline (1999) The 
Plant Cell 11: 557-570). In addition to the diversity in sequence and secondary 
structure, placement of the signal peptide is also varied. Proteins destined for the 
vacuole, for example, have targeting signal peptides found at the N-terminus, at the C- 
terminus and at a surface location in mature, folded proteins. Signal peptides also 
serve as ligands for some receptors. 

These characteristics of signal proteins can be used to more tightly control the 
phenotypic expression of introduced SDFs. In particular, associating the appropriate 
signal sequence with a specific SDF can allow sequestering of the protein in specific 
organelles (plastids, as an example), secretion outside of the cell, targeting interaction 
with particular receptors, etc. Hence, the inclusion of signal proteins in constructs 
involving the SDFs of the invention increases the range of manipulation of SDF 
phenotypic expression. The nucleotide sequence of the signal peptide can be isolated 
from characterized genes using common molecular biological techniques or can be 
synthesized in vitro. 
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In addition, the native signal peptide sequences, both amino acid and nucleotide, 
described in the REF and SEQ Tables can be used to modulate polypeptide transport. 
Further variants of the native signal peptides described in the REF and SEQ Tables are 
contemplated. Insertions, deletions, or substitutions can be made. Such variants will 
retain at least one of the functions of the native signal peptide as well as exhibiting some 
degree of sequence identity to the native sequence. 

Also, fragments of the signal peptides of the invention are useful and can be 
fused with other signal peptides of interest to modulate transport of a polypeptide. 



V. Transformation Techniques 

A wide range of techniques for inserting exogenous polynucleotides are known 
for a number of host cells, including, without limitation, bacterial, yeast, mammalian, 
insect and plant cells. 

Techniques for transforming a wide variety of higher plant species are well 
known and described in the technical and scientific literature. See, e.g. Weising et al., 
Ann. Rev. Genet 22:421 (1988); and Christou, Euphytica, v. 85, n.l-3:13-27, (1995). 

DNA constructs of the invention may be introduced into the genome of the 
desired plant host by a variety of conventional techniques. For example, the DNA 
construct may be introduced directly into the genomic DNA of the plant cell using 
techniques such as electroporation and microinjection of plant cell protoplasts, or the 
DNA constructs can be introduced directly to plant tissue using ballistic methods, such 
as DNA particle bombardment. Alternatively, the DNA constructs may be combined 
with suitable T-DNA flanking regions and introduced into a conventional 
Agrobacterium tumefaciens host vector. The virulence functions of the Agrobacterium 
tumefaciens host will direct the insertion of the construct and adjacent marker into the 
plant cell DNA when the cell is infected by the bacteria (McCormac et al. 3 Mol. 
Biotechnol. 8:199 (1997); Hamilton, Gene 200:107 (1997)); Salomon et al. EMBOJ. 
3:141 (1984); Herrera-Estrella et al. EMBOJ. 2:987 (1983). 

Microinjection techniques are known in the art and well described in the 
scientific and patent literature. The introduction of DNA constructs using polyethylene 
glycol precipitation is described in Paszkowski et al. EMBOJ. 3:2717 (1984). 
Electroporation techniques are described in Fromm et al. Proc. Natl Acad. Sci. USA 
82:5824 (1985). Ballistic transformation techniques are described in Klein et al. Nature 
322:773 (1987). Agrobacterium tumefaciens-mediattd transformation techniques, 
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including disarming and use of binary or co-integrate vectors, are well described in the 
scientific literature. See, for example Hamilton, CM, Gene 200:107 (1997); Muller et al. 
Mol. Gen. Genet. 207:171 (1987); Komari et al. Plant J. 10:165 (1996); Venkateswarlu 
et al. Biotechnology 9:1103 (1991) and Gleave,AP., Plant Mol. Biol 20:1203 (1992); 
Graves and Goldman, Plant Mol Biol 2:34 (1986) and Gould et al., Plant Physiology 
95:426 (1991). 

Transformed plant cells which are derived by any of the above transformation 
techniques can be cultured to regenerate a whole plant that possesses the transformed 
genotype and thus the desired phenotype such as seedlessness. Such regeneration 
techniques rely on manipulation of certain phytohormones in a tissue culture growth 
medium, typically relying on a biocide and/or herbicide marker which has been 
introduced together with the desired nucleotide sequences. Plant regeneration from 
cultured protoplasts is described in Evans et al., Protoplasts Isolation and Culture in 
Handbook of Plant Cell Culture," pp. 124-176, MacMillan Publishing Company, New 
York, 1983; and Binding, Regeneration of Plants, Plant Protoplasts, pp. 21-73, CRC 
Press, Boca Raton, 1988. Regeneration can also be obtained from plant callus, explants, 
organs, or parts thereof. Such regeneration techniques are described generally in Klee et 
dX.Ann. Rev. of Plant Phys. 38:467 (1987). Regeneration of monocots (rice) is described 
by Hosoyama et al. (Bioscl Biotechnol. Biochem. 58:1500 (1994)) and by Ghosh et al. 
(J. Biotechnol 32:1 (1994)). The nucleic acids of the invention can be used to confer 
desired traits on essentially any plant. 

Thus, the invention has use over a broad range of plants, including species from 
the genera Anacardium, Arachis, Asparagus, Atropa, Avena, Brassica, Citrus, Citrullus, 
Capsicum, Carthamus, Cocos, Coffea, Cucumis, Cucurbita, Daucus, Elaeis, Fragaria, 
Glycine, Gossypium, Helianthus, Heterocallis, Hordeum, Hyoscyamus, Lactuca, Linum, 
Lolium,Lupinus, Lycopersicon, Malus, Manihot, Majorana, Medicago, Nicotiana, Olea, 
Oryza, Panieum, Pannesetum, Persea, Phaseolus, Pistachia, Pisum, Pyrus, Prunus, 
Raphanus, Ricinus, Secale, Senecio, Sinapis, Solanum, Sorghum, Theobromus, 
Trigonella, Triticum, Vicia, Vitis, Vigna, and, Zea. 

One of skill will recognize that after the expression cassette is stably 
incorporated in transgenic plants and confirmed to be operable, it can be introduced 
into other plants by sexual crossing. Any of a number of standard breeding 
techniques can be used, depending upon the species to be crossed. 
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The particular sequences of SDFs identified are provided in the attached the 
REF and SEQ Tables. One of ordinary skill in the art, having this data, can obtain 
cloned DNA fragments, synthetic DNA fragments or polypeptides constituting 
desired sequences by recombinant methodology known in the art or described herein. 

EXAMPLES 

The invention is illustrated by way of the following examples. The invention 
is not limited by these examples as the scope of the invention is defined solely by the 
claims following. 

EXAMPLE 1: cDNA PREPARATION 

A number of the nucleotide sequences disclosed in the REF and SEQ Tables 
herein as representative of the SDFs of the invention can be obtained by sequencing 
genomic DNA (gDNA) and/or cDNA from corn plants grown from HYBRID SEED # 
35A19, purchased from Pioneer Hi-Bred International, Inc., Supply Management, P.O. 
Box 256, Johnston, Iowa 50131-0256. 

A number of the nucleotide sequences disclosed in the REF and SEQ Tables 
herein as representative of the SDFs of the invention can also be obtained by 
sequencing genomic DNA from Arabidopsis thaliana, Wassilewskija ecotype or by 
sequencing cDNA obtained from mRNA from such plants as described below. This is 
a true breeding strain. Seeds of the plant are available from the Arabidopsis 
Biological Resource Center at the Ohio State University, under the accession number 
CS2360. Seeds of this plant were deposited under the terms and conditions of the 
Budapest Treaty at the American Type Culture Collection, Manassas, VA on August 
31, 1999, and were assigned ATCC No. PTA-595. 

Other methods for cloning full-length cDNA are described, for example, by 
Seki et aL, Plant Journal 15:707-720 (1998) High-efficiency cloning of Arabidopsis 
full-length cDNA by biotinylated Cap trapper"; Maruyama et al., Gene 138:171 
(1994) Oligo-capping a simple method to replace the cap structure of eukaryotic 
mRNAs with oligoribonucleotides"; and WO 96/34981. 

Tissues were, or each organ was, individually pulverized and frozen in liquid 
nitrogen. Next, the samples were homogenized in the presence of detergents and then 
centrifuged. The debris and nuclei were removed from the sample and more 
detergents were added to the sample. The sample was centrifuged and the debris was 
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removed. Then the sample was applied to a 2M sucrose cushion to isolate polysomes. 
The RNA was isolated by treatment with detergents and proteinase K followed by 
ethanol precipitation and centrifugation. The polysomal RNA from the different 
tissues was pooled according to the following mass ratios: 15/15/1 for male 
inflorescences, female inflorescences and root, respectively. The pooled material was 
then used for cDNA synthesis by the methods described below. 



clones with sequences presented in the REF and SEQ Tables was poly(A)-containing 
polysomal mRNAs from inflorescences and root tissues of corn plants grown from 
HYBRID SEED # 35A19. Male inflorescences and female (pre-and post- 
fertilization) inflorescences were isolated at various stages of development. Selection 
for poly(A) containing polysomal RNA was done using oligo d(T) cellulose columns, 
as described by Cox and Goldberg, Plant Molecular Biology: A Practical Approach", 
pp. 1-35, Shaw ed., c. 1988 by IRL, Oxford. The quality and the integrity of the 
polyA+ RNAs were evaluated. 



cDNA clones with sequences presented in the REF and SEQ Tables was polysomal 
RNA isolated from the top-most inflorescence tissues of Arabidopsis thaliana 
Wassilewskija (Ws.) and from roots of Arabidopsis thaliana Landsberg erecta (L. 
er.), also obtained from the Arabidopsis Biological Resource Center. Nine parts 
inflorescence to every part root was used, as measured by wet mass. Tissue was 
pulverized and exposed to liquid nitrogen. Next, the sample was homogenized in the 
presence of detergents and then centrifuged. The debris and nuclei were removed 
from the sample and more detergents were added to the sample. The sample was 
centrifuged and the debris was removed and the sample was applied to a 2M sucrose 
cushion to isolate polysomal RNA. Cox et al., Plant Molecular Biology: A Practical 
Approach", pp. 1-35, Shaw ed., c. 1988 by IRL, Oxford. The polysomal RNA was 
used for cDNA synthesis by the methods described below. Polysomal mRNA was 
then isolated as described above for corn cDNA. The quality of the RNA was 
assessed electrophoretically. 

Following preparation of the mRNAs from various tissues as described above, 
selection of mRNA with intact 5' ends and specific attachment of an oligonucleotide tag 
to the 5' end of such mRNA was performed using either a chemical or enzymatic 



Starting material for cDNA synthesis for the exemplary corn cDNA 



Starting material for cDNA synthesis for the exemplary Arabidopsis 
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approach. Both techniques take advantage of the presence of the cap" structure, which 
characterizes the 5' end of most intact mRNAs and which comprises a guanosine 
generally methylated once, at the 7 position. 

The chemical modification approach involves the optional elimination of the 2', 
3'-cis diol of the 3' terminal ribose, the oxidation of the 2', 3'-cis diol of the ribose 
linked to the cap of the 5' ends of the mRNAs into a dialdehyde, and the coupling of the 
such obtained dialdehyde to a derivatized oligonucleotide tag. Further detail regarding 
the chemical approaches for obtaining mRNAs having intact 5' ends are disclosed in 
International Application No. W096/34981 published November 7, 1996. 

The enzymatic approach for ligating the oligonucleotide tag to the intact 5' ends 
of mRNAs involves the removal of the phosphate groups present on the 5' ends of 
uncapped incomplete mRNAs, the subsequent decapping of mRNAs having intact 5' 
ends and the ligation of the phosphate present at the 5' end of the decapped mRNA to an 
oligonucleotide tag. Further detail regarding the enzymatic approaches for obtaining 
mRNAs having intact 5' ends are disclosed in Dumas Milne Edwards J.B. (Doctoral 
Thesis of Paris VI University, Le clonage des ADNc complets: difficult^ et perspectives 
nouvelles. Apports pour IStude de la regulation de l'expression de la tryptophane 
hydroxylase de rat, 20 Dec. 1993), EPO 625572 and Kato et al, Gene 15J}:243-250 
(1994). 

In both the chemical and the enzymatic approach, the oligonucleotide tag has a 
restriction enzyme site (e.g. an EcoRI site) therein to facilitate later cloning procedures. 
Following attachment of the oligonucleotide tag to the mRNA, the integrity of the 
mRNA is examined by performing a Northern blot using a probe complementary to the 
oligonucleotide tag. 

For the mRNAs joined to oligonucleotide tags using either the chemical or the 
enzymatic method, first strand cDNA synthesis is performed using an oligo-dT primer 
with reverse transcriptase. This oligo-dT primer can contain an internal tag of at least 4 
nucleotides, which can be different from one mRNA preparation to another. Methylated 
dCTP is used for cDNA first strand synthesis to protect the internal EcoRI sites from 
digestion during subsequent steps. The first strand cDNA is precipitated using 
isopropanol after removal of RNA by alkaline hydrolysis to eliminate residual primers. 

Second strand cDNA synthesis is conducted using a DNA polymerase, such as 
Klenow fragment and a primer corresponding to the 5' end of the ligated 
oligonucleotide. The primer is typically 20-25 bases in length. Methylated dCTP is used 
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for second strand synthesis in order to protect internal EcoRI sites in the cDNA from 
digestion during the cloning process. 

Following second strand synthesis, the full-length cDNAs are cloned into a 
phagemid vector, such as pBlueScript™ (Stratagene). The ends of the full-length 
cDNAs are blunted with T4 DNA polymerase (Biolabs) and the cDNA is digested with 
EcoRI. Since methylated dCTP is used during cDNA synthesis, the EcoRI site present 
in the tag is the only hemi-methylated site; hence the only site susceptible to EcoRI 
digestion. In some instances, to facilitate subcloning, an Hind III adapter is added to the 
3' end of full-length cDNAs. 

The full-length cDNAs are then size fractionated using either exclusion 
chromatography (AcA, Biosepra) or electrophoretic separation which yields 3 to 6 
different fractions. The full-length cDNAs are then directionally cloned either into 
pBlueScript™ using either the EcoRI and Smal restriction sites or, when the Hind III 
adapter is present in the full-length cDNAs, the EcoRI and Hind III restriction sites. The 
ligation mixture is transformed, preferably by electroporation, into bacteria, which are 
then propagated under appropriate antibiotic selection. 

Clones containing the oligonucleotide tag attached to full-length cDNAs are 

selected as follows. 

The plasmid cDNA libraries made as described above are purified (e.g. by a 
column available from Qiagen). A positive selection of the tagged clones is performed 
as follows. Briefly, in this selection procedure, the plasmid DNA is converted to single 
stranded DNA using phage Fl gene II endonuclease in combination with an exonuclease 
(Chang et al., Gene 122:95 (1993)) such as exonuclease III or T7 gene 6 exonuclease. 
The resulting single stranded DNA is then purified using paramagnetic beads as 
described by Fry et al., Biotechniques 13.: 124 (1992). Here the single stranded DNA is 
hybridized with a biotinylated oligonucleotide having a sequence corresponding to the 3' 
end of the oligonucleotide tag. Preferably, the primer has a length of 20-25 bases. 
Clones including a sequence complementary to the biotinylated oligonucleotide are 
selected by incubation with streptavidin coated magnetic beads followed by magnetic 
capture. After capture of the positive clones, the plasmid DNA is released from the 
magnetic beads and converted into double stranded DNA using a DNA polymerase such 
as ThermoSequenase™ (obtained from Amersham Pharmacia Biotech). Alternatively, 
protocols such as the Gene Trapper™ kit (Gibco BRL) can be used. The double stranded 
DNA is then transformed, preferably by electroporation, into bacteria. The percentage 
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of positive clones having the 5' tag oligonucleotide is typically estimated to be between 
90 and 98% from dot blot analysis. 

Following transformation, the libraries are ordered in microliter plates and 
sequenced. The Arabidopsis library was deposited at the American Type Culture 
Collection on January 7, 2000 as E-coli liba 010600" under the accession number 
PTA-1161 . 

EXAMPLE 2: SOUTHERN HYBRIDIZATIONS 

The SDFs of the invention can be used in Southern hybridizations as described 
above. The following describes extraction of DNA from nuclei of plant cells, 
digestion of the nuclear DNA and separation by length, transfer of the separated 
fragments to membranes, preparation of probes for hybridization, hybridization and 
detection of the hybridized probe. 

The procedures described herein can be used to isolate related polynucleotides 
or for diagnostic purposes. Moderate stringency hybridization conditions, as defined 
above, are described in the present example. These conditions result in detection of 
hybridization between sequences having at least 70% sequence identity. As described 
above, the hybridization and wash conditions can be changed to reflect the desired 
percenatge of sequence identity between probe and target sequences that can be 
detected. 

In the following procedure, a probe for hybridization is produced from two 
PCR reactions using two primers from genomic sequence of Arabidopsis thaliana. 
As described above, the particular template for generating the probe can be any 
desired template. 

The first PCR product is assessed to validate the size of the primer to assure it 
is of the expected size. Then the product of the first PCR is used as a template, with 
the same pair of primers used in the first PCR, in a second PCR that produces a 
labeled product used as the probe. 

Fragments detected by hybridization, or other bands of interest, can be isolated 
from gels used to separate genomic DNA fragments by known methods for further 
purification and/or characterization. 

Buffers for nuclear DNA extraction 

1. 10XHB 



1000 ml 
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40 mM spermidine 


10.2 g 


Spermine (Sigma S-2876) and spermidine (Sigma 
S-2501) 


10 mM spermine 


3.5 g 


Stabilize chromatin and the nuclear membrane 


0.1 M EDTA 
(disodium) 


37.2 g 


EDTA inhibits nuclease 

• 


0.1 M Tris 


12.1 g 


Buffer 


0.8 M KC1 


59.6 g 


Adjusts ionic strength for stability of nuclei 



Adjust pH to 9.5 with 10 N NaOH. It appears that there is a nuclease present 
in leaves. Use of pH 9.5 appears to inactivate this nuclease. 

2. 2 M sucrose (684 g per 1000 ml) 

Heat about half the final volume of water to about 50°C. Add the sucrose 
slowly then bring the mixture to close to final volume; stir constantly until it 
has dissolved. Bring the solution to volume. 

3. Sarkosyl solution (lyses nuclear membranes) 

1000 ml 

N-lauroyl sarcosine (Sarkosyl) 20.0 g 

0.1 M Tris 12.1 g 

0.04 M EDTA (Disodium) 14.9 g 

Adjust the pH to 9.5 after all the components are dissolved and bring up to the 
proper volume. 

4. 20% Triton X-100 
80 ml Triton X-100 

320 ml lxHB (w/o (3-ME and PMSF) 
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Prepare in advance; Triton takes some time to dissolve 



A. Procedure 

1. Prepare IX H" buffer (keep ice-cold during use) 

1000 ml 

10X HB 100 ml 

2 M sucrose 250 ml a non-ionic osmoticum 

Water 634 ml 



Added just before use: 



100 mM PMSF* 10 ml a protease inhibitor; 

protects nuclear membrane 
proteins 

B-mercaptoethanol 1 ml inactivates nuclease by 

reducing disulfide bonds 



*100 mM PMSF 

(phenyl methyl sulfonyl fluoride, Sigma P-7626) 
(add 0.0875 g to 5 ml 100% ethanol) 



2. Homogenize the tissue in a blender (use 300-400 ml of lxHB per blender). Be 
sure that you use 5-10 ml of HB buffer per gram of tissue. Blenders generate 
heat so be sure to keep the homogenate cold. It is necessary to put the 
blenders in ice periodically. 



3. Add the 20% Triton X-100 (25 ml per liter of homogenate) and gently stir on 
ice for 20 min. This lyses plastid, but not nuclear, membranes. 



4. Filter the tissue suspension through several nylon filters into an ice-cold 

beaker. The first filtration is through a 250-micron membrane; the second is 
through an 85-micron membrane; the third is through a 50-micron membrane; 
and the fourth is through a 20-micron membrane. Use a large funnel to hold 
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the filters. Filtration can be sped up by gently squeezing the liquid through the 
filters. 

5. Centrifuge the filtrate at 1200 x g for 20 min. at 4°C to pellet the nuclei. 

6. Discard the dark green supernatant. The pellet will have several layers to it. 
One is starch; it is white and gritty. The nuclei are gray and soft. In the early 
steps, there may be a dark green and somewhat viscous layer of chloroplasts. 

Wash the pellets in about 25 ml cold H buffer (with Triton X-100) and 
resuspend by swirling gently and pipetting. After the pellets are resuspended. 

Pellet the nuclei again at 1200 - 1300 x g. Discard the supernatant. 

Repeat the wash 3-4 times until the supernatant has changed from a dark green 
to a pale green. This usually happens after 3 or 4 resuspensions. At this point, 
the pellet is typically grayish white and very slippery. The Triton X-100 in 
these repeated steps helps to destroy the chloroplasts and mitochondria that 
contaminate the prep. 

Resuspend the nuclei for a final time in a total of 15 ml of H buffer and 
transfer the suspension to a sterile 125 ml Erlenmeyer flask. 

7. Add 15 ml, dropwise, cold 2% Sarkosyl, 0.1 M Tris, 0.04 M EDTA solution 
(pH 9.5) while swirling gently. This lyses the nuclei. The solution will 
become very viscous. 

8. Add 30 grams of CsCl and gently swirl at room temperature until the CsCl is 
in solution. The mixture will be gray, white and viscous. 



9. 



Centrifuge the solution at 11,400 x g at 4°C for at least 30 min. The longer 
this spin is, the firmer the protein pellicle. 
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10. The result is typically a clear green supernatant over a white pellet, and 

(perhaps) under a protein pellicle. Carefully remove the solution under the 
protein pellicle and above the pellet. Determine the density of the solution by 
weighing 1 ml of solution and add CsCl if necessary to bring to 1.57 g/ml. 
The solution contains dissolved solids (sucrose etc) and the refractive index 
alone will not be an accurate guide to CsCl concentration. 



11. Add 20 jLtl of 10 mg/ml EtBr per ml of solution. 



12. Centrifuge at 184,000 x g for 16 to 20 hours in a fixed-angle rotor. 

13. Remove the dark red supernatant that is at the top of the tube with a plastic 
transfer pipette and discard. Carefully remove the DNA band with another 
transfer pipette. The DNA band is usually visible in room light; otherwise, 
use a long wave UV light to locate the band. 

14. Extract the ethidium bromide with isopropanol saturated with water and salt. 
Once the solution is clear, extract at least two more times to ensure that all of 
the EtBr is gone. Be very gentle, as it is very easy to shear the DNA at this 
step. This extraction may take a while because the DNA solution tends to be 
very viscous. If the solution is too viscous, dilute it with TE. 

15. Dialyze the DNA for at least two days against several changes (at least three 
times) of TE (10 mM Tris, ImM EDTA, pH 8) to remove the cesium chloride. 

16. Remove the dialyzed DNA from the tubing. If the dialyzed DNA solution 
contains a lot of debris, centrifuge the DNA solution at least at 2500 x g for 10 
min. and carefully transfer the clear supernatant to a new tube. Read the A260 
concentration of the DNA. 



17. Assess the quality of the DNA by agarose gel electrophoresis (1% agarose gel) 
of the DNA. Load 50 ng and 100 ng (based on the OD reading) and compare 
it with known and good quality DNA. Undigested lambda DNA and a 
lambda-Hindlll-digested DNA are good molecular weight makers. 
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Protocol for Digestion of Genomic DNA 

Protocol : 

1. The relative amounts of DNA for different crop plants that provide 
approximately a balanced number of genome equivalent is given in Table 3. 
Note that due to the size of the wheat genome, wheat DNA will be 
underrepresented. Lambda DNA provides a useful control for complete 
digestion. 

2. Precipitate the DNA by adding 3 volumes of 100% ethanol. Incubate at -20 C 
for at least two hours. Yeast DNA can be purchased and made up at the 
necessary concentration, therefore no precipitation is necessary for yeast 
DNA. 

3. Centrifuge the solution at 11,400 x g for 20 min. Decant the ethanol carefully 
(be careful not to disturb the pellet). Be sure that the residual ethanol is 
completely removed either by vacuum desiccation or by carefully wiping the 
sides of the tubes with a clean tissue. 

4. Resuspend the pellet in an appropriate volume of water. Be sure the pellet is 
fully resuspended before proceeding to the next step. This may take about 30 
min. 

5. Add the appropriate volume of 10X reaction buffer provided by the 
manufacturer of the restriction enzyme to the resuspended DNA followed by 
the appropriate volume of enzymes. Be sure to mix it properly by slowly 
swirling the tubes. 

6. Set-up the lambda digestion-control for each DNA that you are digesting. 



7. 



Incubate both the experimental and lambda digests overnight at 37°C. Spin 
down condensation in a microfuge before proceeding. 
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8. After digestion, add 2 jal of loading dye (typically 0.25% bromophenol blue, 
0.25% xylene cyanol in 15% Ficoll or 30% glycerol) to the lambda-control 
digests and load in 1% TPE-agarose gel (TPE is 90 raM Tris-phosphate, 2 raM 
EDTA, pH 8). If the lambda DNA in the lambda control digests are 
completely digested, proceed with the precipitation of the genomic DNA in 
the digests. 



9. Precipitate the digested DNA by adding 3 volumes of 100% ethanol and 
incubating in -20°C for at least 2 hours (preferably overnight). 

EXCEPTION: Arabidopsis and yeast DNA are digested in an appropriate 
volume; they don't have to be precipitated. 

10. Resuspend the DNA in an appropriate volume of TE (e.g., 22 ^il x 50 blots = 
1100 \x\) and an appropriate volume of 10X loading dye (e.g., 2.4 \i\ x 50 blots 
= 120 jil). Be careful in pipetting the loading dye - it is viscous. Be sure you 
are pipetting the correct volume. 

Table 3 



Some guide points in digesting genomic DNA. 



Species 


Genome 
Size 


Size Relative to 
Arabidopsis 


Genome 
Equivalent to 2 
fig Arabidopsis 
DNA 


Amount 
of DNA 
per blot 


Arabidopsis 


120 Mb 


IX 


IX 




Brassica 


1,100 Mb 


9.2X 


0.54X 


10 ng 


Corn 


2,800 Mb 


23.3X 


0.43X 


20 ug 


Cotton 


2,300 Mb 


19.2X 


0.52X 


20 \ig 


Oat 


11,300 Mb 


94X 


0.11X 


20 ng 


Rice 


400 Mb 


3.3X 


0.75X 


5 l^g 


Soybean 


1,100 Mb 


9.2X 


0.54X 


10 ^ 


Sugarbeet 


758 Mb 


6.3X 


0.8X 


10 M g 
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Sweetclover 


1,100 Mb 


9.2X 


0.54X 


10 


Wheat 


16,000 Mb 


133X 


0.08X 


20 ng 


Yeast 


15 Mb 


0.12X 


IX 


0.25 ^g 



Protocol for Southern Blot Analysis 

The digested DNA samples are electrophoresed in 1% agarose gels in lx TPE 
buffer. Low voltage; overnight separations are preferred. The gels are stained with 
EtBr and photographed. 

1. For blotting the gels, first incubate the gel in 0.25 N HC1 (with gentle shaking) 
for about 15 min. 

2. Then briefly rinse with water. The DNA is denatured by 2 incubations. 
Incubate (with shaking) in 0.5 M NaOH in 1.5 M NaCl for 15 min. 

3. The gel is then briefly rinsed in water and neutralized by incubating twice 
(with shaking) in 1.5 M Tris pH 7.5 in 1.5 M NaCl for 15 min. 

4. A nylon membrane is prepared by soaking it in water for at least 5 min, then in 
6X SSC for at least 15 min. before use. (20x SSC is 175.3 g NaCl, 88.2 g 
sodium citrate per liter, adjusted to pH 7.0.) 

5. The nylon membrane is placed on top of the gel and all bubbles in between are 
removed. The DNA is blotted from the gel to the membrane using an 
absorbent medium, such as paper toweling and 6x SCC buffer. After the 
transfer, the membrane may be lightly brushed with a gloved hand to remove 
any agarose sticking to the surface. 

6. The DNA is then fixed to the membrane by UV crosslinking and baking at 
80°C. The membrane is stored at 4°C until use. 

B. Protocol for PCR Amplification of Genomic Fragments in Arabidopsis 



Amplification procedures : 
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1. Mix the following in a 0.20 ml PCR tube or 96-well PCR plate: 



Volume 


Stock 


Final Amount or Cone. 


0.5 nl 


~ 10 ng/|al genomic DNA 1 


5 ng 


2.5 jxl 


10X PCR buffer 


20 mM Tris, 50 mM KC1 


0.75 ^1 


50 mM MgCl 2 


1.5 mM 


1 jxl 


10 pmol/|il Primer 1 (Forward) 


10 pmol 


1 jxl 


10 pmol/jal Primer 2 (Reverse) 


10 pmol 


0.5 nl 


5 mM dNTPs 


0.1 mM 


0.1 \il 


5 units/nl Platinum Taq™ (Life 
Technologies, Gaithersburg, MD) 
DNA Polymerase 


1 units 


(to 25 jal) 


Water 





2. The template DNA is amplified using a Perkin Elmer 9700 PCR 
machine: 



1) 94°C for 10 min. followed by 



2) 

5 cycles: 


2) 

5 cycles: 


4) 

25 cycles: 


94 °C- 30 sec 
62 °C - 30 sec 
72 °C - 3 min 


94 °C - 30 sec 
58 °C - 30 sec 
72 °C - 3 min 


94 °C - 30 sec 
53 °C- 30 sec 
72 °C- 3 min 



1 Arabidopsis DNA is used in the present experiment, but the procedure is a general one. 
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5) 72 C for 7 min. Then the reactions are stopped by chilling to 4°C. 
The procedure can be adapted to a multi-well format if necessary. 
Quantification and Dilution of PCR Products: 

1. The product of the PCR is analyzed by electrophoresis in a 1% agarose gel. A 
linearized plasmid DNA can be used as a quantification standard (usually at 
50, 100, 200, and 400 ng). These will be used as references to approximate 
the amount of PCR products. Hindlll-digested Lambda DNA is useful as a 
molecular weight marker. The gel can be run fairly quickly; e.g., at 100 volts. 
The standard gel is examined to determine that the size of the PCR products is 
consistent with the expected size and if there are significant extra bands or 
smeary products in the PCR reactions. 

2. The amounts of PCR products can be estimated on the basis of the plasmid 
standard. 

3. For the small number of reactions that produce extraneous bands, a small 
amount of DNA from bands with the correct size can be isolated by dipping a 
sterile 10-jal tip into the band while viewing though a UV Transilluminator. 
The small amount of agarose gel (with the DNA fragment) is used in the 
labeling reaction. 

C. Protocol for PCR DIG-Labeling of DNA 

Solutions : 

Reagents in PCR reactions (diluted PCR products, 10X PCR Buffer, 50 raM 
MgCl 2 , 5 U/|al Platinum Taq Polymerase, and the primers) 

10X dNTP + DIG-ll-dUTP [1:5]: (2 mM dATP, 2 mM dCTP, 2 mM dGTP, 
1.65 mM dTTP, 0.35 mM DIG-ll-dUTP) 
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10X dNTP + DIG-ll-dUTP [1:10]: (2 mM dATP, 2 mM dCTP, 2 mM dGTP, 
1.81 mM dTTP, 0.19 mM DIG-ll-dUTP) 

10X dNTP + DIG-ll-dUTP [1:15]: (2 mM dATP, 2 mM dCTP ? 2 mM dGTP, 
1.875 mM dTTP, 0.125 mM DIG-ll-dUTP) 

TE buffer (10 mM Tris, 1 mM EDTA, pH 8) 

Maleate buffer: In 700 ml of deionized distilled water, dissolve 11.61 g 
maleic acid and 8.77 g NaCl. Add NaOH to adjust the pH to 7.5. Bring the 
volume to 1 L. Stir for 15 min. and sterilize. 

10% blocking solution: In 80 ml deionized distilled water, dissolve 1.16g 
maleic acid. Next, add NaOH to adjust the pH to 7.5. Add 10 g of the 
blocking reagent powder (Boehringer Mannheim, Indianapolis, IN, Cat. no. 
1096176). Heat to 60°C while stirring to dissolve the powder. Adjust the 
volume to 100 ml with water. Stir and sterilize. 

1% blocking solution: Dilute the 10% stock to 1% using the maleate buffer. 



Buffer 3 (100 mM Tris, 100 mM NaCl, 50 mM MgCl 2 , pH9.5). Prepared 
from autoclaved solutions of 1M Tris pH 9.5, 5 M NaCl, and 1 M MgCl 2 in 
autoclaved distilled water. 



Attorney No . ^B|0-1243P 



• 



98 



Procedure : 

1. PCR reactions are performed in 25 ul volumes containing: 



PCR buffer 
MgCl 2 

10X dNTP + DIG-ll-dUTP 

Platinum Taq™ Polymerase 
10 pg probe DNA 
10 pmol primer 1 



IX 

1.5 mM 

IX (please see the note 
below) 

1 unit 



Note: 



IPX dNTP + DIG-1 1 -dl JTP (1 :5) 



10X dNTP + DIG-ll-dUTP (1:10) 
10X dNTP + DIG-ll-dUTP (1:15) 



Use for : 
< 1 kb 



1 kb to 1.8 kb 
> 1.8 kb 



The PCR reaction uses the following amplification cycles: 
1) 94°C for 10 min. 



2) 




2) 




4) 


5 cycles: 




5 cycles: 




25 cycles: 


95°C - 


30 sec 


95°C - 


30 sec 


95°C -30 sec 


61°C - 


1 min 


59°C - 


1 min 


51°C - 1 min 


73°C - 


5 min 


75°C - 


5 min 


73°C - 5 min 



5) 72°C for 8 min. The reactions are terminated by chilling to 4°C (hold). 

3. The products are analyzed by electrophoresis- in a 1% agarose gel, comparing 
to an aliquot of the unlabelled probe starting material. 



4. The amount of DIG-labeled probe is determined as follows: 
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Make serial dilutions of the diluted control DNA in dilution buffer (TE: 10 
mM Tris and 1 raM EDTA, pH 8) as shown in the following table: 



DIG-labeled control 
DNA starting cone. 


Stepwise Dilution 


Final Cone. (Dilution 
Name) 


5 ng/pl 


1 pi in 49 pi TE 


100 pg/pl (A) 


100 pg/pl (A) 


25 pi in 25 pi TE 


50 pg/pl (B) 


50 pg/pl (B) 


25 pi in 25 pi TE 


25 pg/pl (C) 


25 pg/pl (C) 


20 pi in 30 pi TE 


10 pg/pl (D) 



Serial deletions of a DIG-labeled standard DNA ranging from 100 pg 
to 10 pg are spotted onto a positively charged nylon membrane, 
marking the membrane lightly with a pencil to identify each dilution. 



b. Serial dilutions (e.g., 1:50, 1:2500, 1:10,000) of the newly labeled 
DNA probe are spotted. 

c. The membrane is fixed by UV crosslinking. 

d. The membrane is wetted with a small amount of maleate buffer and 
then incubated in 1% blocking solution for 15 min at room temp. 

e. The labeled DNA is then detected using alkaline phosphatase 
conjugated anti-DIG antibody (Boehringer Mannheim, Indianapolis, 
IN, cat. no. 1093274) and an NBT substrate according to the 
manufacture's instruction. 



f. 



Spot intensities of the control and experimental dilutions are then 
compared to estimate the concentration of the PCR-DIG-labeled probe. 
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D. Prehybridization and Hybridization of Southern Blots 

Solutions : 

100% Formamide purchased from Gibco 



20X SSC (IX = 0.15 M NaCl, 0.015 M Na 3 citrate) 

per L: 175 g NaCI 

87.5gNa 3 citrate*2H 2 0 



20% Sarkosyl (N-lauroyl-sarcosine) 
20% SDS (sodium dodecyl sulphate) 

i0% Blocking Reagent: In 80 ml deionized distilled water, dissolve 1.16 g 
maleic acid. Next, add NaOH to adjust the pH to 7.5. Add 10 g of the 
blocking reagent powder. Heat to 60°C while stirring to dissolve the 
powder. Adjust the volume to 100 ml with water. Stir and sterilize. 



Prehybridization Mix: 



Final 

Concentration 


Components 


Volume 
(per 100 ml) 


Stock 


50% 


Formamide 


50 ml 


100% 


5X 


SSC 


25 ml 


20X 


0.1% 


Sarkosyl 


0.5 ml 


20% 


0.02% 


SDS 


0.1 ml 


20% 


2% 


Blocking Reagent 


20 ml 


10% 




Water 


4.4 ml 





General Procedures : 

1. Place the blot in a heat-sealable plastic bag and add an appropriate volume of 
prehybridization solution (30 ml/1 00cm 2 ) at room temperature. Seal the bag 
with a heat sealer, avoiding bubbles as much as possible. Lay down the bags 
in a large plastic tray (one tray can accommodate at least 4—5 bags). Ensure 
that the bags are lying flat in the tray so that the prehybridization solution is 
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evenly distributed throughout the bag. Incubate the blot for at least 2 hours 
with gentle agitation using a waver shaker. 

2. Denature DIG-labeled DNA probe by incubating for 10 min. at 98°C using the 
PCR machine and immediately cool it to 4°C. 

3. Add probe to prehybridization solution (25 ng/ml; 30 ml = 750 ng total probe) 
and mix well but avoid foaming. Bubbles may lead to background. 

4. Pour off the prehybridization solution from the hybridization bags and add 
new prehybridization and probe solution mixture to the bags containing the 
membrane. 

5. Incubate with gentle agitation for at least 16 hours. 

6. Proceed to medium stringency post-hybridization wash: 

Three times for 20 min. each with gentle agitation using IX SSC, 1% SDS at 
60°C. 

All wash solutions must be prewarmed to 60°C. Use about 100 ml of wash 
solution per membrane. 

To avoid background keep the membranes fully submerged to avoid drying in 
spots; agitate sufficiently to avoid having membranes stick to one another. 

7. After the wash, proceed to immunological detection and CSPD development. 
E. Procedure for Immunological Detection with CSPD 



Solutions : 



Buffer 1: 



Maleic acid buffer (0.1 M maleic acid, 0.15 M 
NaCl; adjusted to pH 7.5 with NaoH) 



Washing buffer: 



Maleic acid buffer with 0.3% (v/v) Tween 20. 
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Blocking stock solution 



10% blocking reagent in buffer 1. Dissolve 



(10X concentration): blocking reagent powder 
(Boehringer Mannheim, Indianapolis, IN, cat. 
no. 1096176) by constantly stirring on a 65°C 



heating block or heat in a microwave, autoclave 
and store at 4°C. 



Buffer 2 



(IX blocking solution): 



Dilute the stock solution 1:10 in Buffer 1. 



Detection buffer: 



0.1 M Tris, 0.1 M NaCl, pH 9.5 



Procedure : 

1. After the post-hybridization wash the blots are briefly rinsed (1-5 min.) in the 
maleate washing buffer with gentle shaking. 

2. Then the membranes are incubated for 30 min. in Buffer 2 with gentle 
shaking. 

3. Anti-DIG-AP conjugate (Boehringer Mannheim, Indianapolis, IN, cat. no. 
1093274) at 75 mU/ml (1:10,000) in Buffer 2 is used for detection. 75 ml of 
solution can be used for 3 blots. 

4. The membrane is incubated for 30 min. in the antibody solution with gentle 
shaking. 

5. The membrane are washed twice in washing buffer with gentle shaking. 
About 250 mis is used per wash for 3 blots. 

6. The blots are equilibrated for 2-5 min in 60 ml detection buffer. 

7. Dilute CSPD (1:200) in detection buffer. (This can be prepared ahead of time 
and stored in the dark at 4°C). 

The following steps must be done individually. Bags (one for detection and 
one for exposure) are generally cut and ready before doing the following steps. 

8. The blot is carefully removed from the detection buffer and excess liquid 
removed without drying the membrane. The blot is immediately placed in a 
bag and 1.5 ml of CSPD solution is added. The CSPD solution can be spread 
over the membrane. Bubbles present at the edge and on the surface of the blot 
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are typically removed by gentle rubbing. The membrane is incubated for 5 
min. in CSPD solution. 

9. Excess liquid is removed and the membrane is blotted briefly (DNA side up) 
on Whatman 3 MM paper. Do not let the membrane dry completely. 

10. Seal the damp membrane in a hybridization bag and incubate for 10 min at 
37°C to enhance the luminescent reaction. 

11. Expose for 2 hours at room temperature to X-ray film. Multiple exposures can 
be taken. Luminescence continues for at least 24 hours and signal intensity 
increases during the first hours. 

Example 3: Transformation of Carrot Cells 

Transformation of plant cells can be accomplished by a number of methods, as 
described above. Similarly, a number of plant genera can be regenerated from tissue 
culture following transformation. Transformation and regeneration of carrot cells as 
described herein is illustrative. 

Single cell suspension cultures of carrot (Daucus carota) cells are established 
from hypocotyls of cultivar Early Nantes in B 5 growth medium (O.L. Gamborg et al., 
Plant Physiol 45:372 (1970)) plus 2,4-D and 15 mM CaCl 2 (B 5 -44 medium) by 
methods known in the art. The suspension cultures are subcultured by adding 10 ml 
of the suspension culture to 40 ml of B 5 -44 medium in 250 ml flasks every 7 days and 
are maintained in a shaker at 150 rpm at 27 °C in the dark. 

The suspension culture cells are transformed with exogenous DNA as 
described by Z. Chen et al. Plant MoL Bio. 36:163 (1998). Briefly, 4-days post- 
subculture cells are incubated with cell wall digestion solution containing 0.4 M 
sorbitol, 2% driselase, 5mM MES (2-[N-Morpholino] ethanesulfonic acid) pH 5.0 for 
5 hours. The digested cells are pelleted gently at 60 xg for 5 min. and washed twice 
in W5 solution containing 154 mM NaCl, 5 mM KC1, 125 mM CaCl 2 and 5mM 
glucose, pH 6.0. The protoplasts are suspended in MC solution containing 5 mM 
MES, 20 mM CaCl 2 , 0.5 M mannitol, pH 5.7 and the protoplast density is adjusted to 
about 4 x 10 6 protoplasts per ml. 

15-60 |ag of plasmid DNA is mixed with 0.9 ml of protoplasts. The resulting 
suspension is mixed with 40% polyethylene glycol (MW 8000, PEG 8000), by gentle 
inversion a few times at room temperature for 5 to 25 min. Protoplast culture medium 
known in the art is added into the PEG-DNA-protoplast mixture. Protoplasts are 
incubated in the culture medium for 24 hour to 5 days and cell extracts can be used 
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for assay of transient expression of the introduced gene. Alternatively, transformed 
cells can be used to produce transgenic callus, which in turn can be used to produce 
transgenic plants, by methods known in the art. See, for example, Nomura and 
Komamine, Pit, Phys. 79:988-991 (1985), Identification and Isolation of Single Cells 
that Produce Somatic Embryos in Carrot Suspension Cultures, 

A deposit of an E, coli Library, E. cc?//LibA021800, was made at the American 
Type Culture Collection in Manassas, Virginia, USA on February 23, 2000 to meet 
the requirements of Budapest Treaty for the international recognition of the deposit of 
microorganisms. This deposit was assigned accession number PTA-1411. 

A deposit of an E. coli Library, E. co/*Lib060700, was made at the American 
Type Culture Collection in Manassas, Virginia, USA on June 8, 2000 to meet the 
requirements of Budapest Treaty for the international recognition of the deposit of 
microorganisms. This deposit was assigned accession number PTA-2007. 

The invention being thus described, it will be apparent to one of ordinary skill 
in the art that various modifications of the materials and methods for practicing the 
invention can be made. Such modifications are to be considered within the scope of 
the invention as defined by the following claims. 

Each of the references from the patent and periodical literature cited herein is 
hereby expressly incorporated in its entirety by such citation. 



